Oblivious resampling oracles and parallel algorithms for the Lopsided   Lovasz Local Lemma

David G. Harris

arXiv:1702.02547·cs.DS·October 13, 2023

Oblivious resampling oracles and parallel algorithms for the Lopsided Lovasz Local Lemma

David G. Harris

PDF

Open Access

TL;DR

This paper introduces the concept of obliviousness in resampling oracles for the Lopsided Lovász Local Lemma, enabling faster parallel algorithms and new resampling oracles for complex combinatorial structures.

Contribution

It identifies the obliviousness property in resampling oracles, leading to a unified parallel LLLL algorithm and new resampling oracles for rainbow perfect matchings and Hamiltonian cycles.

Findings

01

Developed a faster parallel LLLL algorithm using obliviousness.

02

First RNC algorithms for rainbow perfect matchings and Hamiltonian cycles.

03

Constructed new sequential and commutative resampling oracles for complex structures.

Abstract

The Lov\'{a}sz Local Lemma (LLL) is a probabilistic tool which shows that, if a collection of "bad" events $B$ in a probability space are not too likely and not too interdependent, then there is a positive probability that no bad-events in $B$ occur. Moser & Tardos (2010) gave sequential and parallel algorithms which transformed most applications of the variable-assignment LLL into efficient algorithms. A framework of Harvey & Vondr\'{a}k (2015) based on "resampling oracles" extended this to general sequential algorithms for other probability spaces satisfying the Lopsided Lov\'{a}sz Local Lemma (LLLL). We describe a new structural property which holds for all known resampling oracles, which we call "obliviousness." Essentially, it means that the interaction between two bad-events $B, B^{'}$ depends only on the randomness used to resample $B$ , and not the precise state…

Equations77

B \equiv X_{i_{1}} = j_{1} \land \dots \land X_{i_{k}} = j_{k}

B \equiv X_{i_{1}} = j_{1} \land \dots \land X_{i_{k}} = j_{k}

(u, r) \approx (Ω∣ B) \times Γ_{B} Pr (r u = v) = Ω [v]

(u, r) \approx (Ω∣ B) \times Γ_{B} Pr (r u = v) = Ω [v]

r_{1} \approx Γ_{B_{1}} Pr (r_{1} u = w) r_{2} \approx Γ_{B_{2}} Pr (r_{2} w = u^{'}) = r_{2} \approx Γ_{B_{2}} Pr (r_{2} u = w^{'}) r_{1} \approx Γ_{B_{1}} Pr (r_{1} w^{'} = u^{'})

r_{1} \approx Γ_{B_{1}} Pr (r_{1} u = w) r_{2} \approx Γ_{B_{2}} Pr (r_{2} w = u^{'}) = r_{2} \approx Γ_{B_{2}} Pr (r_{2} u = w^{'}) r_{1} \approx Γ_{B_{1}} Pr (r_{1} w^{'} = u^{'})

r \approx Γ_{B} Pr (r w = u) = Ω [u] Ω [B] /Ω [w] .

r \approx Γ_{B} Pr (r w = u) = Ω [u] Ω [B] /Ω [w] .

r_{1} \approx Γ_{B_{1}} Pr (r_{1} u = w) r_{2} \approx Γ_{B_{2}} Pr (r_{2} w = u^{'}) = r_{2} \approx Γ_{B_{2}} Pr (r_{2} u = w^{'}) r_{1} \approx Γ_{B_{1}} Pr (r_{1} w^{'} = u^{'})

r_{1} \approx Γ_{B_{1}} Pr (r_{1} u = w) r_{2} \approx Γ_{B_{2}} Pr (r_{2} w = u^{'}) = r_{2} \approx Γ_{B_{2}} Pr (r_{2} u = w^{'}) r_{1} \approx Γ_{B_{1}} Pr (r_{1} w^{'} = u^{'})

r_{1} \approx Γ_{B_{1}} Pr (r_{1} u = w) r_{2} \approx Γ_{B_{2}} Pr (r_{2} w = u^{'}) = \frac{Ω [ u ^{'} ] Ω [ B _{1} ]}{Ω [ w ]} \times \frac{Ω [ w ] Ω [ B _{2} ]}{Ω [ u ]} = \frac{Ω [ u ^{'} ] Ω [ B _{1} ] Ω [ B _{2} ]}{Ω [ u ]}

r_{1} \approx Γ_{B_{1}} Pr (r_{1} u = w) r_{2} \approx Γ_{B_{2}} Pr (r_{2} w = u^{'}) = \frac{Ω [ u ^{'} ] Ω [ B _{1} ]}{Ω [ w ]} \times \frac{Ω [ w ] Ω [ B _{2} ]}{Ω [ u ]} = \frac{Ω [ u ^{'} ] Ω [ B _{1} ] Ω [ B _{2} ]}{Ω [ u ]}

\overline{\mathcal{A}}=\bigl{\{}\langle E\rangle\mid\text{$E$ a stable subset of $\mathcal{A}$}\bigr{\}}

\overline{\mathcal{A}}=\bigl{\{}\langle E\rangle\mid\text{$E$ a stable subset of $\mathcal{A}$}\bigr{\}}

E = {(B, B^{'}) ∣ B \sim B^{'} or r_{B} \in / R_{B; B^{'}}}

E = {(B, B^{'}) ∣ B \sim B^{'} or r_{B} \in / R_{B; B^{'}}}

u^{'} u = {u^{'} u if u^{'} \neq = 1 if u^{'} = 1

u^{'} u = {u^{'} u if u^{'} \neq = 1 if u^{'} = 1

A = {π \in S_{n} ∣ π (x) = y}

A = {π \in S_{n} ∣ π (x) = y}

(y_{2} z_{2}) (y_{1} z_{1}) π = (y_{1} z_{1}^{'}) (y_{2} z_{2}^{'}) π

(y_{2} z_{2}) (y_{1} z_{1}) π = (y_{1} z_{1}^{'}) (y_{2} z_{2}^{'}) π

e (1 + ϵ) \frac{( n - s )!}{n !} 2 s n (s - 1 Δ - 1) \leq 1

e (1 + ϵ) \frac{( n - s )!}{n !} 2 s n (s - 1 Δ - 1) \leq 1

e (1 + ϵ) \frac{( n - s )!}{n !} 2 s (s - 1 Δ - 1) \leq \frac{2 e ( 1 + ϵ ) s Δ \dots Δ}{( s - 1 )! n \times n \dots n} = \frac{2 e ( 1 + ϵ ) s Δ ^{s - 1}}{( s - 1 )! n ^{s - 1}}

e (1 + ϵ) \frac{( n - s )!}{n !} 2 s (s - 1 Δ - 1) \leq \frac{2 e ( 1 + ϵ ) s Δ \dots Δ}{( s - 1 )! n \times n \dots n} = \frac{2 e ( 1 + ϵ ) s Δ ^{s - 1}}{( s - 1 )! n ^{s - 1}}

p = \frac{( n / s ) ( n / s - 1 )}{( s n ) ( s n - s )} .

p = \frac{( n / s ) ( n / s - 1 )}{( s n ) ( s n - s )} .

α \geq \frac{( n / s ) ( n / s - 1 )}{( s n ) ( s n - r )} \times (1 + (s - 1 n - 1) (Δ - 1) α)^{2 s}

α \geq \frac{( n / s ) ( n / s - 1 )}{( s n ) ( s n - r )} \times (1 + (s - 1 n - 1) (Δ - 1) α)^{2 s}

\forall B \in B Ω Pr (B) (1 + ϵ) \leq x (B) A \in N (B) \prod (1 - x (A))

\forall B \in B Ω Pr (B) (1 + ϵ) \leq x (B) A \in N (B) \prod (1 - x (A))

\forall B \in B μ (B) \geq Ω Pr (B) (1 + ϵ) I \subseteq \overline{N} (B) I stable \sum A \in I \prod μ (A)

\forall B \in B μ (B) \geq Ω Pr (B) (1 + ϵ) I \subseteq \overline{N} (B) I stable \sum A \in I \prod μ (A)

S \in S ∣ S ∣ \geq t /2 \sum w (S) = (1 + ϵ)^{- t /2} S \in S ∣ S ∣ \geq t /2 \sum w (S) (1 + ϵ)^{t /2} \leq (1 + ϵ)^{- t /2} S \in S \sum w (S) (1 + ϵ)^{∣ S ∣} = (1 + ϵ)^{- t /2} W

S \in S ∣ S ∣ \geq t /2 \sum w (S) = (1 + ϵ)^{- t /2} S \in S ∣ S ∣ \geq t /2 \sum w (S) (1 + ϵ)^{t /2} \leq (1 + ϵ)^{- t /2} S \in S \sum w (S) (1 + ϵ)^{∣ S ∣} = (1 + ϵ)^{- t /2} W

\forall B \in B μ (B) \geq (1 + ϵ) Ω Pr (B) E \subseteq \overline{N} (B) E orderable to B \sum A \in E \prod μ (A)

\forall B \in B μ (B) \geq (1 + ϵ) Ω Pr (B) E \subseteq \overline{N} (B) E orderable to B \sum A \in E \prod μ (A)

Ω [v]

Ω [v]

= Pr (\tilde{r} \tilde{u} = v ∣ \tilde{u} \in Q_{i + 1} \land \tilde{r} \in R_{A_{i}; {A_{i + 1}, \dots, A_{k}}}) Pr (\tilde{u} \in Q_{i + 1}) Pr (\tilde{r} \in R_{A_{i}; {A_{i + 1}, \dots, A_{k}}})

Pr (\tilde{r} \tilde{u} = v ∣ \tilde{u} \in Q_{i + 1} \land \tilde{r} \in R_{A_{i}; {A_{i + 1}, \dots, A_{k}}}) = Pr (r_{i} u_{i} = v)

Pr (\tilde{r} \tilde{u} = v ∣ \tilde{u} \in Q_{i + 1} \land \tilde{r} \in R_{A_{i}; {A_{i + 1}, \dots, A_{k}}}) = Pr (r_{i} u_{i} = v)

D_{i} = v \in V_{(i, n]}^{π} max ∣ N^{in} (v) \cap V_{(i, n]}^{π} ∣

D_{i} = v \in V_{(i, n]}^{π} max ∣ N^{in} (v) \cap V_{(i, n]}^{π} ∣

\Pr(\mathcal{E}_{i})\leq(1-\tfrac{d}{n})(1-\tfrac{d}{n-1})\dots(1-\tfrac{d}{n-k+1})\leq\bigl{(}1-\tfrac{d}{n}\bigr{)}^{i}\leq e^{-di/n}=e^{-\frac{200i\log n}{i}}=n^{-200}

\Pr(\mathcal{E}_{i})\leq(1-\tfrac{d}{n})(1-\tfrac{d}{n-1})\dots(1-\tfrac{d}{n-k+1})\leq\bigl{(}1-\tfrac{d}{n}\bigr{)}^{i}\leq e^{-di/n}=e^{-\frac{200i\log n}{i}}=n^{-200}

L \leq {O (s) O (\frac{l o g n}{l o g \frac{2 l o g n}{s}}) if s \leq lo g n if s > lo g n

L \leq {O (s) O (\frac{l o g n}{l o g \frac{2 l o g n}{s}}) if s \leq lo g n if s > lo g n

\frac{1}{k !} \times \frac{j - i}{n - i} \times \frac{j - i - 1}{n - i - 1} \times \dots \times \frac{j - i - k + 1}{n - i - k + 1} \leq (j / n)^{k} / k!

\frac{1}{k !} \times \frac{j - i}{n - i} \times \frac{j - i - 1}{n - i - 1} \times \dots \times \frac{j - i - k + 1}{n - i - k + 1} \leq (j / n)^{k} / k!

Pr (L \geq k)

Pr (L \geq k)

T(x_{1},\dots,x_{k})=\bigl{\{}(x_{k}\ z_{k})\cdots(x_{1}\ z_{1})\mid z_{i}\in[n]-\{x_{i},\dots,x_{k}\}\bigr{\}}

T(x_{1},\dots,x_{k})=\bigl{\{}(x_{k}\ z_{k})\cdots(x_{1}\ z_{1})\mid z_{i}\in[n]-\{x_{i},\dots,x_{k}\}\bigr{\}}

\langle q\rangle=\bigl{\{}\pi\in U\mid\pi(x_{1})=x_{2},\dots,\pi(x_{k-1})=x_{k}\bigr{\}}

\langle q\rangle=\bigl{\{}\pi\in U\mid\pi(x_{1})=x_{2},\dots,\pi(x_{k-1})=x_{k}\bigr{\}}

λ_{q} = (x_{k} x_{k - 1} \dots x_{1})

λ_{q} = (x_{k} x_{k - 1} \dots x_{1})

R_{A;A^{\prime}}=\bigl{\{}(x_{k-1}\ z_{k-1})\cdots(x_{1}\ z_{1})\lambda_{q}\mid z_{i}\in[n]-\{y_{2},\dots,y_{j},x_{i},\dots,x_{k-1}\}\bigr{\}}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Quality and Management · Machine Learning and Algorithms · Cryptography and Data Security

Full text

Oblivious resampling oracles and parallel algorithms for the Lopsided Lovász Local Lemma

David G. Harris Department of Computer Science, University of Maryland, College Park, MD 20742. Email: [email protected]

Abstract

The Lovász Local Lemma (LLL) shows that, for a collection of “bad” events $\mathcal{B}$ in a probability space which are not too likely and not too interdependent, there is a positive probability that no events in $\mathcal{B}$ occur. Moser & Tardos (2010) gave sequential and parallel algorithms which transformed most applications of the variable-assignment LLL into efficient algorithms. A framework of Harvey & Vondrák (2015) based on “resampling oracles” extended this to sequential algorithms for other probability spaces satisfying a generalization of the LLL known as the Lopsided Lovász Local Lemma (LLLL).

We describe a new structural property which holds for most known resampling oracles, which we call “obliviousness.” Essentially, it means that the interaction between two bad-events $B,B^{\prime}$ depends only on the randomness used to resample $B$ , and not the precise state within $B$ itself.

This property has two major consequences. First, combined with a framework of Kolmogorov (2016), it leads to a unified parallel LLLL algorithm, which is faster than previous, problem-specific algorithms of Harris (2016) for the variable-assignment LLLL and of Harris & Srinivasan (2014) for permutations. This gives the first RNC algorithms for rainbow perfect matchings and rainbow hamiltonian cycles of $K_{n}$ .

Second, this property allows us to build LLLL probability spaces from simpler “atomic” events. This gives the first resampling oracle for rainbow perfect matchings on the complete $s$ -uniform hypergraph $K_{n}^{(s)}$ , and the first commutative resampling oracle for hamiltonian cycles of $K_{n}$ .

This is an extended version of a paper which appeared in the ACM-SIAM Symposium on Discrete Algorithms (SODA) 2019.

1 The Lovász Local Lemma and its algorithms

The Lovász Local Lemma (LLL) is a fundamental probabilistic tool which shows that for a probability space $\Omega$ with a finite set $\mathcal{B}$ of $m$ “bad” events, then as long as the bad-events are not too interdependent (in a certain technical sense) and are not too likely, there is a positive probability no events in $\mathcal{B}$ occur. The simplest form of the LLL, known as the symmetric LLL, can be stated as follows: if every bad-event $B$ has $\Pr_{\Omega}(B)\leq p$ and is dependent with at most $d$ others, where $epd<1$ , then there is a positive probability that none of the bad-events occur.

Most combinatorial applications of the LLL use a relatively simple probability space, which we call the variable-assignment LLL. This setting has $n$ independent variables $X_{1},\dots,X_{n}$ , and each bad-event $B$ is a boolean function of a subset of these variables denoted $\operatorname{var}(B)$ . Bad-events $B,B^{\prime}$ are dependent (written $B\sim B^{\prime}$ ) iff $\operatorname{var}(B)\cap\operatorname{var}(B^{\prime})\neq\emptyset$ . Moser & Tardos [35] introduced a remarkably simple algorithm for this setting, which we refer to as the MT algorithm:

Moser & Tardos [35] showed that this algorithm terminates quickly whenever the symmetric LLL criterion (or a more general asymmetric LLL criterion) is satisfied. Later work [36, 28, 18] showed that it terminates under more general criteria. See Appendix A for background on the LLL and MT algorithm.

Note that the MT algorithm requires a subroutine to find a bad-event $B$ which is true on the current configuration $X$ (if any). We refer to this as a Bad-Event Checker (BEC). The simplest implemention of this is to loop over all bad-events and test them one by one, which would have a run-time on the order of $m$ . The run-time of the MT algorithm can often be polynomial in $n$ and independent of $m$ if a more-efficient BEC is used [17, 21].

1.1 The Lopsided Lovász Local Lemma

In [10], Erdős & Spencer noted that positive correlation among bad-events (again, in a certain technical sense) is as good as independence for the LLL. This generalization has been referred to as the Lopsided Lovász Local Lemma (LLLL). We say $B,B^{\prime}$ are lopsidependent and write $B\sim B^{\prime}$ if $B,B^{\prime}$ are neither independent nor positively correlated in this sense. (Formal definitions are provided later in Section 2.)

Although the variable-assignment LLL covers the vast majority of applications in combinatorics, the LLLL is also used occasionally. For example, the original application of the LLLL used a probability space on permutations to construct Latin transversals for certain types of arrays [10]. Other applications include hamiltonian cycles on $K_{n}$ [4], perfect matchings of $K_{n}$ [32], perfect matchings of the complete $s$ -uniform hypergraph $K_{n}^{(s)}$ [30], and spanning trees of $K_{n}$ [30].

The variable-assignment setting provides one of the simplest forms of the LLLL. Here, as before, there are independent variables $X_{1},\dots,X_{n}$ . Instead of allowing arbitrary boolean functions of the variables, each bad-event should be a monomial function, i.e. of the form

[TABLE]

For the LLL, we would have $B\sim B^{\prime}$ if the bad-events $B$ and $B^{\prime}$ share some common variable, i.e. $i_{t}=i^{\prime}_{t^{\prime}}$ . For the LLLL, the (lopsi)dependency relation is more restricted: we have $B\sim B^{\prime}$ if $B$ and $B^{\prime}$ disagree on some common variable, i.e. $i_{t}=i^{\prime}_{t^{\prime}}$ and $j_{t}\neq j^{\prime}_{t^{\prime}}$ .

Moser & Tardos showed that their algorithm applies to the variable-assignment LLLL setting. In [22], Harris & Srinivasan developed an algorithm similar to the MT algorithm for the probability space of random permutations, which includes the Latin transversal application of [10]. Extending these problem-specific algorithms, Harvey & Vondrák [25] developed a general framework based on a “resampling oracle” $\mathfrak{R}$ for the probability space. We will define this formally in Section 2, but, intuitively this is a randomized algorithm which, given some state $u$ with some bad-event $B$ true on $u$ , attempts to “rerandomize” the configuration in a “local” way to fix $B$ . This is similar to the way that the MT algorithm resamples the variables involved in $B$ . Given this resampling oracle, the following Algorithm 2 can be used to find a configuration avoiding the bad-events:

These results have led to constructive counterparts to combinatorial results involving spanning trees and matchings of $K_{n}$ (both discussed in [25]) and hamiltonian cycles of $K_{n}$ (subsequently developed in [24]). A further line of research has extended Algorithm 2, and variants, to other spaces which do not directly correspond to the LLLL [1, 2, 3, 23].

We note that the choice of which bad-event to select in line (3) of Algorithm 2 is much more constrained than for the MT algorithm. Only a limited number of possibilities work in general, such as selecting $B$ with smallest index, whereas the MT algorithm allows nearly complete freedom. In [29], Kolmogorov showed that a number of resampling oracles (including variable-assignment, permutations, and perfect matchings of $K_{n}$ ) satisfy an additional property known as commutativity. In such cases, Algorithm 2 also allows an arbitrary choice of which bad-event to select. Kolmogorov [29] and Iliopoulos [27] further showed that this property has powerful algorithmic consequences, including parallel algorithms, efficient BEC’s, and bounds on the output distribution at the termination of Algorithm 2.

1.2 Parallel algorithms

Moser & Tardos also presented a simple parallel version of their resampling algorithm. This parallel algorithm requires a slightly stronger criterion, which we refer to as $\epsilon$ -slack; for instance, the symmetric LLL requires $ep(1+\epsilon)d\leq 1$ ; if this satisfied, then it terminates after $O(\frac{\log m}{\epsilon})$ rounds with high probability.111We say that an event occurs with high probability (abbreviated whp), if it has probability at least $1-n^{-\Omega(1)}$ . On a EREW PRAM, it has overall runtime $O(\frac{\log^{3}m}{\epsilon})$ . We summarize the algorithm as follows:

Haeupler & Harris [16] showed that the parallel MT algorithm could be implemented in time $O(\frac{\log^{3}n}{\epsilon})$ (avoiding dependence on $m$ ) and gave an alternative parallel algorithm in time $O(\frac{\log^{2}m}{\epsilon})$ . The parallel MT algorithm can also usually be implemented even for more general LLL criteria, including the asymmetric LLL and Shearer’s LLL criterion [28].

(In some computational models, multiple processors can write to a memory cell simultaneously and the runtimes can often reduced by logarithmic factors. For simplicity, we will be conservative and use only the EREW PRAM model throughout this paper. We say that an algorithm is in $RNC^{k}$ if it runs in $\tilde{O}(\log^{k}n)$ time and $\operatorname{poly}(n)$ processors whp on an EREW PRAM.)

The parallel MT algorithm leads in a straightforward way to distributed graph algorithms in $O(\frac{\log^{2}m}{\epsilon})$ communication rounds. There has been extensive research into obtaining faster distributed and parallel LLL algorithms; some of these algorithms require significantly stronger (but still local) conditions on the dependency $d$ and probability $p$ of the bad-events [8, 11, 13]. Brandt et al. [7] showed that generic distributed LLL algorithms require $\Omega(\log\log n)$ rounds.

Frustratingly, although the sequential MT algorithm works for the variable-assignment LLLL just as it does for the variable-assignment LLL, this is not true of the parallel MT algorithm. There have been only a handful of parallel algorithms for the LLLL, such as the variable-assignment LLLL algorithm of Harris [18] and the permutation LLL algorithm of Harris & Srinivasan [22].

In [29] Kolmogorov proposed a general framework for constructing parallel LLLL algorithms via resampling oracles, which can be summarized as follows:

Each iteration of the loop of lines (3) — (7) is called a round. Kolmogorov showed that, when the resampling oracle $\mathfrak{R}$ is commutative, then Algorithm 4 terminates whp after $O(\log n)$ rounds. We emphasize this is a sequential algorithm, which is in fact a version of Algorithm 2.

If a single round can be simulated in polylogarithmic time, then this yields an RNC algorithm. In almost every setting where a parallel LLLL algorithm is known (including all the ones in this paper), the resampling oracle is commutative and the parallel algorithm is an implementation of Kolmogorov’s framework.

This makes partial progress to a general parallel LLLL algorithm; however, there remain two significant hurdles. The most straightforward of these is a parallel implementation of $\mathfrak{R}$ . This is trivial for the variable-assignment LLL: if bad-events $B,B^{\prime}$ are both selected for resampling, then $\operatorname{var}(B)$ and $\operatorname{var}(B^{\prime})$ must be disjoint and the resamplings can be executed simultaneously. For other probability spaces, it is not clear how to resample without “locking” the state.

The second and much more fundamental hurdle is that the LLLL resampling process is inherently sequential in a way that the LLL is not. For the LLLL (but not the LLL) it is possible that two bad-events $B,B^{\prime}$ are currently true, and $B\not\sim B^{\prime}$ , and resampling $B$ makes $B^{\prime}$ false. We say in this case that $B$ fixes $B^{\prime}$ . Because of this possibility, $B$ and $B^{\prime}$ cannot be resampled simultaneously; one must select (arbitrarily) one of the two bad-events to resample first, and then only resample the second one if it still remains true. One critical challenge for LLLL algorithms is to simulate in parallel the process of resampling the bad-events in sequence.

The parallel LLLL algorithms of Harris [18] and Harris & Srinivasan [22] overcome these hurdles to a limited extent. However they still suffer from a number of shortcomings. Although they run in polylogarithmic time, the exponent is quite high (and is not computed explicitly). They also require additional structure, such as having bad-events which involve a polylogarithmic number of variables. Finally, and perhaps most seriously, these algorithms are highly tailored to a single probability space. They are reminiscent of the situation for LLL algorithms before the framework of Harvey & Vondrák [25]: specialized algorithms with ad-hoc analysis.

1.3 Our contribution and overview

We identify a new property of resampling oracles that we refer to as obliviousness. To summarize, suppose we have two bad-events $B,B^{\prime}$ with $B\not\sim B^{\prime}$ , and a state $u\in B\cap B^{\prime}$ . The obliviousness property states that whether $B$ fixes $B^{\prime}$ depends solely on the randomness used to resample $B$ , and not on the state $u$ itself. This framework is developed in Section 2. We find it remarkable that so many LLLL probability spaces, even the non-commutative ones, have oblivious resampling oracles: this includes variable-assignment, permutations, perfect matchings of $K_{n}$ , perfect matchings of the hypergraph $K_{n}^{(s)}$ , hamiltonian cycles of $K_{n}$ , and spanning trees of $K_{n}$ .

A unified parallel algorithm. Obliviousness allows us to sidestep the second major hurdle to a parallel LLLL algorithm. It reduces the possibility of $B$ fixing $B^{\prime}$ to a pairwise phenomenon: we only need to know the resampling action chosen for $B$ , not the present state (which may be changing during other resampling actions). The space of sequential resamplings can thus be represented in a simple graph structure, allowing us to efficiently find a valid sequence.

To implement this sequence in parallel, we encode $\mathfrak{R}$ as a monoid action. Specifically, $\mathfrak{R}_{B}$ can be interpreted as a randomly-chosen monoid element $r_{B}$ acting on the current state $u$ . In this way, resampling multiple bad-events $B_{1},\dots,B_{s}$ can be interpreted algebraically as the product $r_{B_{s}}\dots r_{B_{1}}u$ . This is easily parallelized by the associativity of monoidal multiplication.

We summarize our generic parallel LLLL algorithm as follows:

Theorem 1.1 (Informal).

Suppose that $epd(1+\epsilon)\leq 1$ holds for any LLLL probability space with an appropriate parallelizable resampling oracle. Then there is a parallel algorithm in time $O(\frac{\log^{4}n}{\epsilon})$ to find a state avoiding $\mathcal{B}$ .

We summarize some notable applications of this algorithm.

Suppose we have a $k$ -SAT instance on $n$ variables and $m$ clauses, in which each variable appears in at most $L\leq\frac{2^{k+1}(1-1/k)^{k}}{(k-1)(1+\epsilon)}-\frac{2}{k}$ clauses. There is an $RNC^{4}$ algorithm to find a satisfying assignment. 2. 2.

For an integer $c\geq 2$ , suppose that $H$ is a $k$ -uniform hypergraph $H$ where each vertex appears in at most $L=\frac{c^{k}(1-1/k)^{k-1}}{k(c-1)(1+\epsilon)}$ edges. There is an randomized algorithm in $O(\frac{\log^{3}n}{\epsilon})$ rounds for the LOCAL distributed computing model to find a proper vertex $c$ -coloring of $H$ . 3. 3.

Suppose that $A$ is an $n\times n$ matrix whose entries are labeled by colors and each color appears in at most $\Delta$ entries. For $\Delta\leq 0.105n$ , there is an $RNC^{4}$ algorithm to find a Latin transversal of $A$ . For $\Delta\leq n\Bigl{(}\frac{(s-1)!}{2e(1+\epsilon)s}\Bigr{)}^{1/(s-1)}$ there is an $RNC^{4}$ algorithm to find a transversal of $A$ where color appears at most $s$ times. 4. 4.

Suppose that we have an edge-coloring of $K_{n}$ where each color appears on at most $\Delta$ edges. If $\Delta\leq 0.105n$ and $n$ is even, there is a $RNC^{4}$ algorithm to find a rainbow perfect matching. If $\Delta\leq 0.026n$ , there is an $RNC^{4}$ algorithm to find a rainbow hamiltonian cycle.

Versions of the first two results with slightly worse parameters can be derived from the variable-assignment LLL and parallel MT algorithm. Previous slower RNC algorithms are known for the third result. We are not aware of any RNC algorithms comparable with the fourth result; this answers open problems posed by Kolmogorov [29] and Harvey & Liaw [24].

A new resampling framework. Beyond its direct algorithmic impact, obliviousness can simplify a number of resampling oracle constructions. Most LLLL probability spaces come from a set of relatively simple “atomic events.” For example, in the space of uniform permutations, these are events of the form $\pi(x)=y$ . A bad-event $B$ is then taken to be a conjunction of atomic events.

It is intuitively clear that the resampling oracle for the atomic events in some sense “generates” the resampling oracle for $\mathcal{B}$ . A formal description of this has been elusive. To illustrate the difficulty, consider a bad-event $B=A_{1}\cap A_{2}$ and a configuration $u\in B$ , where $A_{1},A_{2}$ are atomic events. We would like to resample $B$ by resampling $A_{1}$ and then resampling $A_{2}$ . In order to obtain the correct probability distribution, we must condition on $A_{2}$ remaining true after resampling $A_{1}$ . For a general resampling oracle, this conditioning step might distort the probability distribution of $u$ in an unmanageable way. But for an oblivious resampling oracle, we are guaranteed that conditioning on $A_{2}$ remaining true retains an independent, uniform distribution for $u$ itself.

We derive a simple list of axioms required for an oblivious resampling oracle for the atomic events only; these automatically lead to a resampling oracle for $\mathcal{B}$ . Beyond the fact that this gives new algorithmic results, this greatly simplifies many proofs and constructions for existing resampling oracles. We highlight a few results:

We get a commutative resampling oracle, and parallel algorithms, for the space of hamiltonian cycles of $K_{n}$ . 2. 2.

We get a resampling oracle for the space of perfect matchings of the complete hypergraph $K_{n}^{(s)}$ . This leads to efficient (sequential) algorithms corresponding to non-constructive results on rainbow hypergraph matchings shown by Lu, Mohr, & Székély [30].

1.4 Outline

In Section 2, we formally define the LLLL in terms of resampling oracles. We provide a new framework which is more algebraic compared to the probabilistic formulation originally developed in [25]. We define the properties needed for resampling oracles, including commutativity and the new property of obliviousness. We also discuss the method for generating LLLL-compatible probability spaces from atomic events.

In Section 3, we describe a new graph algorithm needed for our parallel LLLL algorithm. This computes a structure which is similar to a lexicographically-first MIS (LFMIS), but generalized to directed graphs. This plays a similar role to the MIS in the parallel MT algorithm, but respects the sequential ordering of the bad-events. We show that, for a random vertex order, this LFMIS can be computed efficiently in $O(\log^{2}n)$ rounds by a simple greedy parallel algorithm adapted from Blelloch, Fineman & Shun [6] for undirected graphs. This is a pure graph theory problem which does not directly involve the LLLL, and may be of independent interest.

In Section 4, we describe our generic LLLL algorithm in terms of a resampling oracle from the framework of Section 2.

In Section 5, we analyze the variable-assignment LLLL. We show how the simple resampling oracle (which is just to resample variables from the original distribution) fits into the formal framework of Section 2. We provide a few example applications, to $k$ -SAT and hypergraph coloring.

In Section 6, we describe a few other more “exotic” LLLL spaces, including random permutations, hamiltonian cycles, and perfect matchings. We discuss a few applications, including to strong coloring and a number of Latin transversal problems.

1.5 Notation

Throughout, we let $[n]$ denote the set $\{1,\dots,n\}$ . For a probability space $\Omega$ over a ground set $U$ , we say that $u\approx\Omega$ if $u$ is a random variable drawn according to distribution $\Omega$ . We define $\Omega[u]$ to be the probability mass of $u$ , and we define $\text{Support}(\Omega)$ to be the set of values $u\in U$ with $\Omega[u]>0$ .

For any $V\subseteq U$ we define $\Omega[V]=\Pr_{u\approx\Omega}(u\in V)=\sum_{v\in V}\Omega[v]$ . We also define $\Omega|V$ to be the conditional distribution on $V$ , i.e. $(\Omega|V)[v]=\Omega[v]/\Omega[V]$ for $v\in V$ .

For two random variables $X,Y$ , we say $X\approx Y$ if $X,Y$ follow the same distribution. For any set $X$ , we define $\text{Unif}(X)$ to be the uniform distribution on $X$ .

For $s\geq 2$ , we let $K_{n}^{(s)}$ denote the complete $s$ -uniform hypergraph on vertex set $[n]$ . For $s=2$ (the complete graph), we also write $K_{n}=K_{n}^{(2)}$ . We say that $M$ is a perfect matching of $K_{n}^{(s)}$ if it is a partition of $[n]$ into exactly $n/s$ classes of size $s$ . Whenever we refer to the set of perfect matchings of $K_{n}^{(s)}$ , we will assume implicitly that $s$ divides $n$ .

We define $S_{n}$ to be the symmetric group on $n$ letters, viewed concretely as the set of bijections on ground set $[n]$ . We write $(a\ b)$ for the transposition swapping $a$ and $b$ . We also write $\sigma_{1}\sigma_{2}$ for the functional composition $\sigma_{1}\circ\sigma_{2}$ , that is, the function sending $x$ to $\sigma_{1}(\sigma_{2}(x))$ .

For subsets $A,B$ of an algebraic structure $G$ , we let $AB$ denote the product set $AB=\{ab\mid a\in A,b\in B\}$ . Similarly, for $b\in G,A\subseteq G$ we write $bA=\{ba\mid a\in A\}$ and $Ab=\{ab\mid a\in A\}$ .

For a directed graph $G=(V,E)$ and a vertex $v\in V$ , we define the out-neighborhood $N^{\text{out}}(v)=\{w\mid(v,w)\in E\}$ and the out-degree of $v$ is the cardinality of this set. Similarly we define the in-neighborhood $N^{\text{in}}(v)=\{w\mid(w,v)\in E\}$ , and the in-degree of $v$ is the cardinality of this set.

2 The LLLL and resampling oracles

In this section, we will formally define the LLLL and how to construct a resampling oracle for it, in the sense of Harvey & Vondrák [25]. We note that Erdős & Spencer [10] describes an alternate, probabilistic interpretation of the LLLL, which is slightly more general. Since this is technical to describe and we will never use this interpretation, we will not discuss this here.

Constructions based on the LLLL typically have two phases. First, we choose a large collection of highly-structured “generic” bad-events in a probability space, equipped with an appropriate lopsidependency relation and a resampling oracle. For example, in the variable-assignment LLLL setting, the underlying probability space is a cartesian product space with $n$ independent variables and the generic bad-events are the monomial functions of the form $X_{i_{1}}=j_{1}\wedge\dots\wedge X_{i_{k}}=j_{k}$ for arbitrary values $k,(i_{1},j_{1}),\dots,(i_{k},j_{k})$ . For the permutation setting, the underlying probability space is the uniform distribution on $S_{n}$ and the generic bad-events have the form $\pi(x_{1})=y_{1}\wedge\dots\wedge\pi(x_{k})=y_{k}$ for arbitrary values $k,(x_{1},y_{1}),\dots,(x_{k},y_{k})$ .

It is impossible to avoid all the generic bad-events. The second phase of the LLLL is to select some problem-specific, more-or-less “random”, subset of the generic bad-events. For example, if we wish to satisfy a given $k$ -SAT formula, then for each clause $X_{i_{1}}=j_{1}\vee\dots\vee X_{i_{k}}=j_{k}$ , we would have in $\mathcal{B}$ the bad-event $X_{i_{1}}=1-j_{1}\wedge\dots\wedge X_{i_{k}}=1-j_{k}$ , which is one of the generic bad-events.

In order to show that the LLLL applies, and that Algorithm 2 converges to an assignment avoiding $\mathcal{B}$ , we must show two things: first, that the resampling oracle works properly on the generic set of bad-events containing $\mathcal{B}$ . Second, that the specific chosen subset $\mathcal{B}$ has its probabilities and dependencies sufficiently small; for example, each bad-event $B\in\mathcal{B}$ has $\Pr_{\Omega}(B)\leq p$ and is lopsidependent with at most other $d$ bad-events of $\mathcal{B}$ such that $epd<1$ .

These two phases are almost completely distinct. The first is highly algebraic, while the second is more combinatorial. In this section, we will only discuss the first phase of constructing the generic set of bad-events to be compatible with the LLLL. The second phase, for which we use only standard techniques, is discussed in Appendix A.

2.1 Framework for resampling oracles

Consider a probability space $\Omega$ over a ground set $U$ , along with a collection $\mathcal{B}$ of events in that space. There is also a binary symmetric relation $\sim$ provided for $\mathcal{B}$ , which we refer to as the dependency relation.222More properly, this should be referred to as a “lopsidependency” relation. The distinction between dependency and lopsidependency is not important for us so we use the simpler terminology. We will define the properties needed for a resampling oracle $\mathfrak{R}$ for this space, in the sense of Algorithm 2, along with the new property “obliviousness” which we will need for our algorithms. We will later construct a number of such resampling oracles.

We will define $\mathfrak{R}$ by specifying a monoid $R$ which acts on $U$ . We refer to the $R$ -act on $U$ as the resampling action, and we write it as $ru$ for $r\in R,u\in U$ . We also define, for each $B\in\mathcal{B}$ , a probability distribution $\Gamma_{B}$ over $R$ and we define $R_{B}=\text{Support}(\Gamma_{B})\subseteq R$ . The intent is to define the resampling oracle $\mathfrak{R}_{B}$ as $\mathfrak{R}_{B}(u)=ru$ where $r\approx\Gamma_{B}$ . Note that it is very important for us to separate the role of the randomness used in $\mathfrak{R}_{B}$ .

Before we define our new obliviousness property, let us reiterate the conditions of Harvey & Vondrák [25] and Kolmogorov [29], in terms of our notation.333Kolmogorov [29] refers to property (C3) here as “strong commutativity.” We will never use the weaker commutativity properties defined by Kolmogorov, so we just refer to this as commutativity for convenience.

(C1)

(Probability regeneration) For any $B\in\mathcal{B}$ and any fixed $v\in U$ , we have

[TABLE] 2. (C2)

(Locality) If $B\not\sim B^{\prime}$ , and $u\in B-B^{\prime}$ , then for all $r\in R_{B}$ we have $ru\notin B^{\prime}$ . 3. (C3)

(Commutativity) Let $B_{1}\not\sim B_{2}$ . For any states $u\in B_{1}\cap B_{2}$ and $u^{\prime}\in U$ , there is an injective mapping from states $w\in B_{2}\cap R_{1}u$ with $u^{\prime}\in R_{2}w$ , to states $w^{\prime}\in B_{1}\cap R_{2}u$ with $u^{\prime}\in R_{1}w^{\prime}$ , such that

[TABLE]

Observation 2.1.

If Properties (C1) and (C2) are satisfied, then the randomized function $\mathfrak{R}_{B}$ defined by choosing $r\approx\Gamma_{B}$ and outputting $\mathfrak{R}_{B}(u)=ru$ , gives a resampling oracle in the sense of Harvey & Vondrák [25]. If (C3) is also satisfied, then the resampling oracle $\mathfrak{R}_{B}$ is commutative in the sense of Kolmogorov [29].

We define a resampling-space to be an ensemble of such objects $\mathcal{B},R,U,\Omega,\sim$ satisfying (C1) and (C2). We sometimes refer to the overall ensemble also just as $\mathcal{B}$ . We define the neighborhood of $B\in\mathcal{B}$ by $N(B)=\{A\in\mathcal{B}:A\sim B\}$ and we also define $\overline{N}(B)=N(B)\cup\{B\}$ .

Observe that if $\mathcal{C},R,U,\Omega,\sim$ is a resampling-space and $\mathcal{B}\subseteq\mathcal{C}$ , then $\mathcal{B},R,U,\Omega,\sim$ is also a resampling-space (where $\sim$ is the restriction to $\mathcal{B}$ ). Furthermore, if (C3) holds for $\mathcal{C}$ then it holds for $\mathcal{B}$ as well. We emphasize that these properties alone do not imply that that Algorithm 2 will converge when using the resampling oracle $\mathfrak{R}$ . Our usual strategy is to show that some generic set $\mathcal{C}$ is a resampling-space with desired properties, and then take $\mathcal{B}$ to be an arbitrary subset of $\mathcal{C}$ . We then show that one of the LLLL convergence criteria, such as Shearer’s criterion, is satisfied on $\mathcal{B}$ . See Appendix A for further details and definitions.

Bearing this in mind, we can summarize the main result of [25] as follows:

Theorem 2.2 ([25]).

If $\mathcal{B}$ is a resampling-space which satisfies Shearer’s criterion, then Algorithm 2 terminates in expected polynomial time.

We are now ready to introduce the new structural property:

(C4)

(Obliviousness) For all pairs $B,B^{\prime}$ in $\mathcal{B}$ with $B\not\sim B^{\prime}$ , and all $r\in R_{B}$ , one of the following two conditions holds:

(a)

For all $u\in B\cap B^{\prime}$ we have $ru\in B^{\prime}$ 2. (b)

For all $u\in B\cap B^{\prime}$ we have $ru\notin B^{\prime}$

We refer to this as obliviousness since whether $ru$ is in $B$ does not depend upon the state $u$ . In light of (C4), let us define set $R_{B;B^{\prime}}=\{r\in R_{B}\mid ru\in B^{\prime}\}$ . We also define the conditional probability distribution $\Gamma_{B;B^{\prime}}=\Gamma_{B}|R_{B;B^{\prime}}$ , and for any set $E\subseteq\mathcal{B}$ we define $R_{B;E}=\bigcap_{B^{\prime}\in E}R_{B;B^{\prime}}$ and $\Gamma_{B;E}=\Gamma_{B}|R_{B;E}$ .

The definition of commutativity as it appears in (C3) is cumbersome to work with and lacks good compositional properties. To make it easier to show (C3), we use an additional property of resampling oracles identified by Achlioptas & Iliopoulos [1], which we refer to as injectivity.444In [1], this property is referred to as atomicity. We use the alternate terminology injectivity to avoid confusion with our discussion of atomic bad-events. We state one variant of this property as follows:

(C5)

(Injectivity) For all $u\in U$ and $B\in\mathcal{B}$ , there is exactly one $w\in B$ with $u\in R_{B}w$ .

Our main motivation for this property is that it greatly simplifies condition (C3), allowing us to use an alternate condition (C3’) instead:

(C3’)

For all pairs $B_{1},B_{2}$ and all $u\in B_{1}\cap B_{2}$ we have $R_{B_{2}}R_{B_{1};B_{2}}u=R_{B_{1}}R_{B_{2};B_{1}}u$ .

We summarize this in the following result:

Proposition 2.3.

If properties (C3’), (C4), (C5) hold, then property (C3) holds.

Proof.

We begin with a preliminary calculation: consider any $B\in\mathcal{B},w\in B,u\in R_{B}w$ . By (C1) we have $\Pr_{r\approx\Gamma_{B},w^{\prime}\approx\Omega|B}(rw^{\prime}=u)=\Omega[u]$ . By (C5), we have $rw^{\prime}=u$ only if $w^{\prime}=w$ , and so $\Pr_{r\approx\Gamma_{B},w^{\prime}\approx\Omega|B}(rw^{\prime}=u)=\Pr_{r\approx\Gamma_{B},w^{\prime}\approx\Omega|B}(rw^{\prime}=u\wedge w=w^{\prime})=\Omega[w]/\Omega[B]\times\Pr_{r\approx\Gamma_{B}}(rw=u)$ . Combining these equations, we get the following formula:

[TABLE]

Let us now show (C3). Fix $B_{1},B_{2},u,u^{\prime}$ . By (C5), at most one state $w$ has $w\in R_{B_{1}}u,u^{\prime}\in R_{B_{2}}w$ . If there is no such $w$ , then there is nothing to show. Otherwise, by (C3’) there must exist $w^{\prime}$ with $u^{\prime}\in R_{B_{1}}w^{\prime},w^{\prime}\in R_{B_{2}}u$ . We map $w$ to this $w^{\prime}$ . Since there is only one possible value $w$ , the mapping is trivially injective. We need to show that this pair $w,w^{\prime}$ satisfies

[TABLE]

By Eq. (1), we have

[TABLE]

A symmetric argument shows that $\Pr_{r_{2}\approx\Gamma_{B_{2}}}(r_{2}u=w^{\prime})\Pr_{r_{1}\approx\Gamma_{B_{1}}}(r_{1}w^{\prime}=u^{\prime})$ is also equal to this quantity. ∎

2.2 Atomically-generated probability spaces

Most known resampling-spaces have a nicer form: the bad-events $B$ are conjunctions of a limited class of “atomic” events. For example, for the variable-assignment LLLL, an atomic event is $X_{i}=j$ ; for the space of uniform permutations, an atomic event is $\pi(x)=y$ . The obliviousness property allows us to formalize this: we can define a resampling oracle and a simple list of axioms for the atomic events alone, and then we automatically get a resampling oracle for conjunctions of atomic events. This vastly simplifies the constructions for a number of diverse LLLL spaces.

Let $\mathcal{A},R,U,\Omega,\sim$ be an oblivious resampling-space. We say that a set $E\subseteq\mathcal{A}$ is stable if $A\not\sim A^{\prime}$ for all distinct pairs $A,A^{\prime}\in E$ , and we define $\langle E\rangle=\bigcap_{A\in E}A$ . For $A_{1},\dots,A_{k}\in A$ , we also write $\langle A_{1},\dots,A_{k}\rangle$ as shorthand for $A_{1}\cap\dots\cap A_{k}=\langle\{A_{1},\dots,A_{k}\}\rangle$ .

Let us define $\overline{\mathcal{A}}$ to be the set of conjunctions of events of $\mathcal{A}$ ,

[TABLE]

We will use the same ground set $U$ and monoid $R$ for $\overline{\mathcal{A}}$ . The new dependency relation $\sim$ for $\overline{\mathcal{A}}$ is defined by setting $\langle E\rangle\sim\langle E^{\prime}\rangle$ if there exist $A\in E,A^{\prime}\in E^{\prime}$ with $A\sim A^{\prime}$ .

The key to the construction is to extend the distributions $\Gamma_{A}$ for the atomic events to a probability distribution $\Gamma_{C}$ for an event $C=\langle E\rangle$ in $\overline{\mathcal{A}}$ . To do so, we select some arbitrary fixed ordering as $E=\{A_{1},\dots,A_{k}\}$ , and we then define $\Gamma_{C}$ to be the distribution over products $r=r_{k}r_{k-1}\cdots r_{2}r_{1}$ , wherein $r_{1},\dots,r_{k}$ are independent random variables and $r_{i}$ is drawn from distribution $\Gamma_{A_{i};\{A_{i+1},\dots,A_{k}\}}$ . (For $k=0$ , $r$ is the identity element of $R$ .)

Theorem 2.4.

If $\mathcal{A}$ is an oblivious resampling-space, then so is $\overline{\mathcal{A}}$ . If, in addition, $\mathcal{A}$ satisfies (C5) and (C3’), then so does $\overline{\mathcal{A}}$ ; in particular, $\overline{\mathcal{A}}$ is commutative.

The proof of Theorem 2.4 is technical, so we defer it to Appendix C. In later sections, we use it for a number of new and simpler constructions of resampling-spaces. Notably, these include hamiltonian cycles of $K_{n}$ and perfect matchings of $K_{n}^{(s)}$ . Our construction for hamiltonian cycles of $K_{n}$ is commutative, in contrast to a previous resampling oracle construction of Harvey & Liaw [24]. No resampling oracle of any kind was known for perfect matchings of $K_{n}^{(s)}$ for any $s>2$ .

2.3 Efficient resampling oracles

Our framework for resampling oracles, in which $\mathfrak{R}$ is derived from a monoid $R$ , may seem overly restrictive. In fact, it is without loss of generality: for an arbitrary resampling oracle in the sense of Harvey & Vondrák [25], we could simply take $R$ to be the full transformation monoid. This would be useless computationally, because writing down an element of $R$ would require exponential time.

In order to get an efficient parallel algorithm we must be able to efficiently compute on $R$ . We summarize the requirements in terms of four properties (D0)—(D3); the runtime bounds are chosen so that the resampling action does not become the computational bottleneck for the overall algorithm described later. Here the parameter $n$ measures the input length to the algorithm.

(D0)

We can sample from $\Omega$ in $O(\log^{4}n)$ time and $\operatorname{poly}(n)$ processors. 2. (D1)

For any $B\in\mathcal{B}$ , we can sample from $\Gamma_{B}$ in $O(\log^{3}n)$ time and $\operatorname{poly}(n)$ processors. 3. (D2)

For $r\in R$ and $u\in U$ , we can compute $ru$ in $O(\log^{3}n)$ time and $\operatorname{poly}(n)$ processors. 4. (D3)

For $r,r^{\prime}\in R$ , we can compute $rr^{\prime}$ in $O(\log^{2}n)$ time and $\operatorname{poly}(n)$ processors.

For atomically-generated probability spaces, these properties can themselves be simplified:

Proposition 2.5.

Suppose that $\mathcal{B}\subseteq\overline{\mathcal{A}}$ , such that every bad-event $B\in\mathcal{B}$ is given by $B=\langle E\rangle$ for some stable set $E\subseteq\mathcal{A}$ with $|E|\leq\operatorname{poly}(n)$ . Suppose that $\mathcal{A}$ satisfies property (D3) as well as the the following property (D1’):

(D1’)

For any $A\in\mathcal{A}$ and stable set $E\subseteq\mathcal{A}$ with $A\not\sim E$ and $|E|\leq\operatorname{poly}(n)$ , we can sample from $\Gamma_{A;E}$ in $O(\log^{3}n)$ time and $\operatorname{poly}(n)$ processors.

Then $\mathcal{B}$ satisfies property (D1).

Proof.

Let $B=\langle E\rangle$ for some stable set $E=\{A_{1},\dots,A_{k}\}$ with $k\leq\operatorname{poly}(n)$ . To draw $r\approx\Gamma_{C}$ , we first use (D1’) to sample independent variables $r_{1},\dots,r_{k}$ wherein each $r_{i}$ drawn from $\Gamma_{A_{i};\{A_{i+1},\dots,A_{k}\}}$ . We then use (D3) to compute $r=r_{k}\cdots r_{1}$ in $O(\log k\times\log^{2}n)=O(\log^{3}n)$ time. ∎

We say that a resampling space is amenable if it satisfies the following computational conditions:

•

It satisfies properties (C3)–(C4).

•

The monoid $R$ satisfies properties (D0)–(D3).

•

It has has a BEC running in $O(\log^{3}n)$ time and $\operatorname{poly}(n)$ processors.

We will later describe a parallel algorithm for such spaces. Note that, even without these properties, the resampling-space may still be be useful for a sequential algorithm or a combinatorial existence proof. Also, note that the third condition is satisfied if $m\leq\operatorname{poly}(n)$ and we can efficiently check each bad-event in $O(\log^{3}n)$ time.

2.4 Cartesian products

Another useful method for constructing resampling-spaces comes from a cartesian product construction. Consider resampling-spaces $\mathcal{C}_{i},R_{i},U_{i},\Omega_{i},\sim_{i}$ for $i=1,\dots,s$ . We define a new resampling-space $\mathcal{C}=\mathcal{C}_{1}\times\dots\times\mathcal{C}_{s}$ as follows. The underlying space is $U=U_{1}\times\dots\times U_{s}$ and $\Omega$ is the corresponding product distribution. The monoid $R$ is the cartesian product $R_{1}\times\dots\times R_{s}$ , with the natural monoid act on $U$ . The events in $\mathcal{C}$ are those of the form $C_{1}\times\dots\times C_{s}$ , where $C_{i}\in\mathcal{C}_{i}$ . For such an event $C$ , we define $\Gamma_{C}$ to be the probability distribution on tuples $(r_{1},\dots,r_{s})$ , wherein $r_{1},\dots,r_{s}$ are independent, and $r_{i}$ is drawn from $\Gamma_{C_{i}}$ in resampling-space $\mathcal{C}_{i}$ . The relation $\sim$ on $\mathcal{C}$ is defined by $(C_{1},\dots,C_{s})\sim(C^{\prime}_{1},\dots,C^{\prime}_{s})$ if there is an index $i\in\{1,\dots,s\}$ where $C_{i}\sim_{i}C^{\prime}_{i}$ .

The following is immediate from the definitions:

Observation 2.6.

If $\mathcal{C}_{1},\dots,\mathcal{C}_{s}$ are oblivious resampling-spaces, then so is $\mathcal{C}$ .

If in addition $\mathcal{C}_{1},\dots,\mathcal{C}_{s}$ are commutative, then so is $\mathcal{C}$ .

If in addition $s\leq\operatorname{poly}(n)$ and $\mathcal{C}_{1},\dots,\mathcal{C}_{s}$ satisfy properties (D0)–(D3), then so does $\mathcal{C}$ .

As an example, the permutation LLL as defined in [22] allows selection of $s$ permutations $\pi_{1},\dots,\pi_{s}$ , wherein each $\pi_{i}$ is drawn independently and uniformly from some $S_{n_{i}}$ , and a bad-event has the form $\pi_{i_{1}}(x_{1})=y_{1}\wedge\dots\wedge\pi_{i_{k}}(x_{k})=y_{k}$ . This can be modeled as the cartesian product of the uniform distributions on $S_{n_{1}},\dots,S_{n_{s}}$ . Therefore, the resampling action defined by the uniform distribution on $S_{n}$ immediately gives a corresponding resampling action for the permutation LLL.

3 LFMIS for directed graphs

Before we describe the parallel LLLL algorithm, we need an important graph-theoretic subroutine: the LFMIS for directed graphs. This plays a similar role for our LLLL algorithm as the MIS does for the parallel MT algorithm. By itself, the LFMIS has little connection to the LLLL, and may be of independent combinatorial and algorithmic interest.

For an undirected graph $G$ , an independent set of $G$ is a vertex set $S$ where no two vertices in $S$ are adjacent in $G$ . A maximal independent set (MIS) has the additional property that no $T\supsetneq S$ is an independent set of $G$ . There is a trivial sequential algorithm to find an MIS of $G$ by adding vertices one-by-one to $S$ . The MIS produced by this sequential algorithm is referred to as the lexicographically first MIS (LFMIS).

With a slight abuse of terminology, we can extend the definition of LFMIS to a directed graph $G=(V,E)$ . Formally, we define the LFMIS of $G$ with respect to a permutation $\pi:[n]\rightarrow V$ to be the vertex set $I$ produced by the following sequential process:

An undirected graph $G$ can be viewed as a directed graph $G^{\prime}$ , where every edge $(u,v)\in G$ corresponds to two directed edges $(u,v),(v,u)\in G^{\prime}$ . The LFMIS (in the usual sense) of $G$ is then identical to the directed LFMIS of $G^{\prime}$ .

The LFMIS problem for undirected graphs is P-complete in general [9]. However, Blelloch, Fineman, Shun [6] described a simple parallel greedy algorithm to find the LFMIS of an undirected graph, when $\pi$ is chosen uniformly at random. The algorithm can also be used for directed graphs. We summarize it as follows, where we define $P^{\pi}(v)$ for a vertex $v$ to be the set of vertices $w$ with $\pi^{-1}(w)<\pi^{-1}(v)$ .

This can be viewed as a parallel algorithm, where each iteration of identifying the residual source nodes $J$ and adding them to $I$ , can be implemented in $O(\log n)$ time and $O(m+n)$ processors. Alternatively, it can be viewed as a distributed algorithm, where each iteration requires $O(1)$ distributed communication rounds on $G$ . We get the following main result to analyze Algorithm 6.

Theorem 3.1.

Algorithm 6 produces the LFMIS of $G$ with respect to $\pi$ . When $\pi$ is chosen uniformly at random, then Algorithm 6 terminates in $O(\log^{2}n)$ rounds whp. In particular, Algorithm 6 runs in $\tilde{O}(\log^{3}n)$ time on an EREW PRAM whp.

The analysis is very similar to the proof given in [6], which showed that the (undirected) degrees are rapidly reduced when $G$ is an undirected graph. We defer the full proof of Theorem 3.1 to Appendix D, which shows a slightly stronger result. Note that Fischer & Noever [12] later showed that Algorithm 6 terminates in $O(\log n)$ rounds whp for undirected graphs. We conjecture that it should be possible to improve our analysis and show that Algorithm 6 runs in $O(\log n)$ rounds whp on directed graphs as well.

4 A generic parallel resampling algorithm

We are now ready to describe our parallel algorithm for an amenable resampling-space. We recall that throughout, the parameter $n$ represents the description size of a configuration, such that a state $u$ is encoded in $\operatorname{poly}(n)$ bits. Correspondingly, our goal for an RNC algorithm is to achieve $\text{polylog}(n)$ runtime, $\operatorname{poly}(n)$ processors, and success probability $1-n^{-\Omega(1)}$ .

Clearly, if Algorithm 7 terminates, then all the bad-events in $\mathcal{B}$ are false on $u$ . For maximum generality, we analyze Algorithm 7 in terms of two parameters $W,\epsilon$ from the Shearer LLLL criterion; see Appendix A for a precise definition. Theorem A.2 gives a few simpler LLL criteria, including the symmetric, asymmetric, and cluster-expansion criteria. For most applications, $W\leq\operatorname{poly}(n)$ and $\epsilon\geq\Omega(1)$ . Our main result will be the following:

Theorem 4.1.

Let $\mathcal{B}$ be an amenable resampling-space. If the Shearer criterion is satisfied with parameters $\epsilon,W$ , then Algorithm 7 runs in $O(\frac{\log^{4}(n+W\epsilon)}{\epsilon})$ time and $\operatorname{poly}(n,W)$ processors whp.

For most applications, we can use a simplified corollary:

Corollary 4.2.

Let $\mathcal{B}$ be an amenable resampling-space which satisfies the symmetric LLL criterion $epd(1+\epsilon)<1$ . Then Algorithm 7 runs in $O(\frac{\log^{4}(mn)}{\epsilon})$ time and $\operatorname{poly}(m,n)$ processors whp.

Some probability spaces have convergence and distributional properties which go beyond the generic bounds such as Shearer’s criterion [18, 19, 27]. Since Algorithm 7 can be viewed as a simulation of the sequential algorithm, all such bounds apply equally to it. We will see some examples in the next section with analysis of the variable-assignment LLLL.

We now turn to proving Theorem 4.1. We assume throughout that $\mathcal{B}$ is amenable. We refer to each iteration of the main loop of Algorithm 7 (lines (3) – (8)) as a round. We use $V_{t},I_{t},\pi_{t}$ , etc to denote the quantities corresponding to round $t$ , and also define $b_{t}=|V_{t}|$ . We first observe that a single round can be implemented efficiently.

Proposition 4.3.

Each round of Algorithm 7 can be implemented using $\operatorname{poly}(b_{t},n)$ processors and $O(\log^{3}(b_{t}n))$ time whp.

Proof.

Since $\mathcal{B}$ is amenable, we can determine the set $V_{t}$ using our BEC in $O(\log^{3}n)$ time.

By (D1), we can draw the random variables $r_{B}$ in time $O(\log b_{t}+\log^{3}n)$ . In light of (C4), we can efficiently check if $r_{B}\in R_{B;B^{\prime}}$ , by computing $r_{B}u$ and testing if $r_{B}u\in R_{B^{\prime}}$ .

By Theorem 3.1, we can find $I$ in time $O(\log^{3}(b_{t}n))$ and $\operatorname{poly}(b_{t},n)$ processors whp.

To implement step (7), we use use the associativity of monoid multiplication to compute the product $r_{B_{i_{s}}}\cdots r_{B_{i_{1}}}$ in $\lceil\log_{2}s\rceil$ rounds of pairwise multiplications. By (D3), each round takes $O(\log^{2}n)$ time. Noting that $s\leq b_{t}$ , this gives a total of $O(\log^{3}(b_{t}n))$ time and $\operatorname{poly}(b_{t},n)$ processors. Once this product is computed, we can use (D2) to compute $r_{B_{i_{s}}}\cdots r_{B_{i_{1}}}u$ . ∎

Thus, our main task is to show that Algorithm 7 terminates after a small number of rounds. We do so by coupling it to a sequential resampling algorithm, Algorithm 8.

By the principle of deferred decisions, there is no difference in selecting the random variable $r_{B}$ in a “preprocessed” way (as in line (4) of Algorithm 8), as opposed to in “online” way as in Algorithm 2. Thus, line (8) of Algorithm 8 is equivalent to executing the resampling oracle $\mathfrak{R}_{B}(u)$ and so Algorithm 8 can be viewed as a version of Kolmogorov’s algorithm (Algorithm 4).

For Algorithm 8, define $\pi_{t}$ to be the chosen ordering of $V_{t}$ , i.e. the map sending $i$ to $B_{i}$ in $V_{t}$ . Also define $I^{\prime}_{t}$ to be the set of events resampled in round $t$ , i.e. the events $B_{k}$ such that $B_{k}\in A$ at iteration $k$ of line (7). The following result shows the equivalence between Algorithm 8 and Algorithm 7:

Proposition 4.4.

If the random variables $\pi,u,r$ are all fixed at the beginning of round $t$ and $I,I^{\prime}$ are the LFMIS produced for Algorithms 7 and 8 respectively for round $t$ , then $I=I^{\prime}$ .

Proof.

Let $u_{j}$ denote the state after iteration $j$ of round $t$ (and $u_{0}$ is the state at the beginning of round $t$ ). We have $V_{t}$ enumerated as $\{B_{1},B_{2},\dots,B_{k}\}$ where $\pi(B_{1})<\pi(B_{2})<\dots<\pi(B_{k})$ , and we write $r_{i}$ as shorthand for $r_{B_{i}}$ .

With this notation, observe that $B_{j}\in I$ iff there is no $i<j$ with $B_{i}\in I$ and either (a) $B_{i}\sim B_{j}$ or (b) $r_{i}\in R_{i}-R_{B_{i};B_{j}}$ . Similarly, $B_{j}\in I^{\prime}$ iff there is no $i<j$ with $B_{i}\in I^{\prime}$ and either (a) $B_{i}\sim B_{j}$ or (b) $B_{j}$ is false on $u_{i}$ . For contradiction, say that $j$ is minimal such that the membership of $B_{j}$ differs in $I$ and $I^{\prime}$ .

Suppose that $B_{j}\in I^{\prime}-I$ . Since $B_{j}\notin I$ , there must be some $i<j$ with $B_{i}\in I$ such that $B_{i}\sim B_{j}$ or $r_{i}\notin R_{B_{i};B_{j}}$ . In the former case, by our induction hypothesis $B_{i}\in I^{\prime}$ and this would contradict that $B_{j}\in I^{\prime}$ . In the latter case, note that since $B_{j}\in I^{\prime}$ , it must be that $B_{j}$ is true on $u_{i}$ and $u_{i-1}$ and $B_{i}$ is true on $u_{i-1}$ . Thus, $u_{i-1}\in B_{i}\cap B_{j}$ and $u_{i}=r_{i}u_{i-1}\in B_{j}$ . So $r_{i}\in R_{B_{i};B_{j}}$ , a contradiction.

Next, suppose that $B_{j}\in I-I^{\prime}$ . Since $B_{j}\notin I^{\prime}$ , there must be some $i<j$ with $B_{i}\in I^{\prime}$ such that $B_{i}\sim B_{j}$ or $B_{j}$ is false on $u_{i}$ . Let $i$ be minimal subject to these conditions. In the former case, by induction hypothesis $B_{i}\in I$ ; in the latter case, by minimality of $i$ , it must be that $B_{j}$ becomes false after resampling $B_{i}$ , and so $B_{i}\in I$ . In either case, we have $B_{i}\in I$ . So $u_{i-1}\in B_{i}\cap B_{j}$ and $u_{i}=r_{i}u_{i-1}\notin B_{j}$ , implying that $r_{i}\in R_{B_{i}}-R_{B_{i};B_{j}}$ . Thus $G$ has an edge $(B_{i},B_{j})$ , contradicting that $B_{j}\in I$ . ∎

They key property we need to analyze Algorithm 8 is the following:

Lemma 4.5.

If $B\in V_{t}$ for $t\geq 2$ , then $\overline{N}(B)\cap I^{\prime}_{t-1}\neq\emptyset$ .

Proof.

In the execution of Algorithm 8, let $T_{i}$ denote the total number of resamplings before round $i$ (so $T_{1}=0$ ), and note that $u^{T_{i}}$ is the state immediately at the beginning of round $i$ .

By definition, $B$ must be true on $u^{T_{t}}$ . Either $B$ is true at time $T_{t-1}$ or $B\sim B^{\prime}\in I^{\prime}_{t-1}$ ; otherwise, by property (C4), $B$ would remain false after all the resamplings in round $t-1$ .

If $B\sim B^{\prime}\in I^{\prime}_{t-1}$ or $B\in I^{\prime}_{t-1}$ we are done. Otherwise, suppose $B\in V_{t-1}-I^{\prime}_{t-1}$ . This can only be the case if $B$ was marked as dead in round $t-1$ . Suppose this occurs at time $i$ , during the resampling of some $B^{\prime}\in I^{\prime}_{t-1}$ . If $B\sim B^{\prime}$ , we are done.

Otherwise, suppose that $B$ is false on $u^{i}$ . Since $B$ is true at the beginning of round $t$ , by (C4) there must be some $B^{\prime\prime}$ resampled between times $i$ and $T_{t}$ with $B^{\prime\prime}\sim B$ , i.e. $B^{\prime\prime}\in\overline{N}(B)\cap I^{\prime}_{t-1}$ . ∎

Lemma 4.5 in combination with analysis of Kolmogorov [29] shows that Algorithm 7 terminates in a small (polylogarithmic) number of rounds. There is also a “random-like” distribution of the states during intermediate stages of the parallel LLLL algorithm. In all, we get the following bound:

Lemma 4.6.

Whp, Algorithm 7 terminates after $O(\frac{\log(n+W\epsilon)}{\epsilon})$ rounds and $\sum_{t}b_{t}\leq O(W\operatorname{poly}(n))$ .

The proof of Lemma 4.6 requires significant background and a number of preliminary definitions, so we defer it to Appendix A.

Now let $s=O(\frac{\log(n+W\epsilon)}{\epsilon})$ denote the total number of rounds in Algorithm 7. Proposition 4.3 shows that each round $t$ uses $O(\log^{3}(b_{t}n))$ time and $\operatorname{poly}(b_{t},n)$ processors whp. Property (D0) allows us to implement step (1) in $O(\log^{4}n)$ time. Thus the overall runtime of Algorithm 7 is at most $O(\log^{4}n+s\log^{3}n+\sum_{t=1}^{s}\log^{3}(b_{t}))$ . By concavity, we have $\sum_{t=1}^{s}\log^{3}b_{t}\leq s\log^{3}(1+\sum_{t=1}^{s}b_{t}/s)$ . By Lemma 4.6, we have $\sum_{t}b_{t}\leq O(W\operatorname{poly}(n))$ whp. Thus, the time complexity here is as most $O(s\log^{3}(n+W\epsilon))$ and the processor count is at most $\operatorname{poly}(n,W)$ .

This shows Theorem 4.1. Corollary 4.2 follows directly, noting that $W\leq O(m)$ .

5 The variable-assignment LLLL

The variable-assignment LLLL is one of the most important LLLL probability spaces. Let us set notation and discuss how this fits into our resampling framework. We also discuss a few unique properties of the variable-assignment LLLL as well as some applications.

To begin the construction, we first consider the simplest setting, where the probability space $\Omega$ is defined by single variable $X$ over a universe $U$ . The generic bad-event set $\mathcal{B}$ has the tautological event $\top$ , as well as an event $B_{u}\equiv X=u$ for each $u\in U$ . We define $\sim$ by setting $B_{u}\sim B_{u^{\prime}}$ for $u\neq u^{\prime}$ . (The event $\top$ is not dependent with any others.)

We form $R$ using a construction called the find-last monoid. Formally, we define $R=U\cup\{1\}$ , where $1$ is an identity element. The binary operation on $R$ is defined as

[TABLE]

Note that $U\subseteq R$ , with $ru\in U$ for $u\in U$ , and so $R$ naturally gives a left $R$ -act on $U$ .

For event $\top$ , we define $\Gamma_{\top}$ to be the value $1$ with probability one. For an event $B_{u}$ , we define $\Gamma_{B_{u}}$ to be the distribution $\Omega$ . One can easily verify that the resulting resampling oracle $\mathfrak{R}_{B_{u}}$ is defined by $\mathfrak{R}_{B_{u}}(x)=u^{\prime}$ , where $u^{\prime}$ is drawn from the distribution $\Omega$ , i.e. we resample the variable. It is trivial to verify that this resampling-space satisfies (C3’), (C4), (C5), and (D0)–(D3).

We can get the full variable-assignment LLLL via the cartesian product construction. Namely, the probability space is over $U=D^{n}$ for some discrete set $D$ , and each bad-event $B$ has the form $(B_{1},\dots,B_{n})$ , wherein $B_{i}$ is either $\top$ or an event $X_{i}=j_{i}$ . Equivalently, $B$ can be written as $B\equiv X_{i_{1}}=j_{1}\wedge\dots\wedge X_{i_{k}}=j_{k}$ . For such an event, we define $\Gamma_{B}$ as follows: For $r=(r_{1},\dots,r_{n})\approx\Gamma_{B}$ , the entries $r_{1},\dots,r_{n}$ are all independent, wherein $r_{i}=1$ for $B_{i}=\top$ and $r_{i}\approx\Omega_{i}$ otherwise. The resulting oracle $\mathfrak{R}_{B}$ is to simply resample the variables $X_{i_{1}},\dots,X_{i_{k}}$ . By Observation 2.6, this resampling-space is again amenable.

This is a very notationally heavy way of describing a very simple probability space and a very simple resampling action. However, it illustrates how our resampling framework gives a non-trivial resampling-space (the full variable-assignment LLLL) by composing a few trivial building-blocks.

5.1 Alternate LLLL criterion

In [18], Harris described an alternative convergence criterion for the MT algorithm called orderability. This is defined in terms of a function $\mu:\mathcal{B}\rightarrow[0,\infty)$ ; the full formal definitions are technical and are deferred to Appendix B. As our parallel algorithm for the variable-assignment LLLL can be viewed as an implementation of the MT algorithm, the orderability criterion can also be used to analyze Algorithm 7. This gives the following result:

Theorem 5.1.

Let $\mu:\mathcal{B}\rightarrow[0,\infty)$ satisfy the orderability variable-assignment criterion with $\epsilon$ -slack, and let $W=\sum_{B\in\mathcal{B}}\mu(B)$ . If $\mathcal{B}$ has a BEC using $O(\log^{4}n)$ time and $\operatorname{poly}(n)$ processors, then Algorithm 7 runs in $O(\frac{\log^{4}(n+W\epsilon)}{\epsilon})$ time and $\operatorname{poly}(n,W)$ processors whp.

As a example application, we get the following result:

Proposition 5.2.

Suppose we have a $k$ -SAT instance in $n$ variables, where each variable appears in at most $L=\frac{2^{k+1}(1-1/k)^{k}}{(k-1)(1+\epsilon)}-\frac{2}{k}$ clauses. Then there is a parallel algorithm to find a satisfying assignment in $O(\frac{\log^{4}n}{\epsilon})$ time using $\operatorname{poly}(n)$ processors whp.

Proof.

As shown in [18, Theorem 4.1], the orderability criterion can be satisfied with slack $\epsilon$ satisfied under these conditions using the weighting function $\mu(B)=\frac{1+\epsilon}{(2-2/k)^{k}}$ for all $B\in\mathcal{B}$ . Furthermore, $W\leq m\leq nk\leq n^{2}$ and we can implement a BEC by checking every clause. ∎

5.2 Distributed algorithms

The LOCAL model is a popular model for distributed graph algorithms. Here, in each round, a node in a graph can perform arbitrary computations and has unlimited communication with its neighbors. Distributed LLL algorithms can solve a number of graph problems in this setting, where each vertex $v$ has a set of associated bad-events $\mathcal{B}_{v}$ local to $v$ , and bad-events in $\mathcal{B}_{v}$ and $\mathcal{B}_{u}$ are dependent iff the distance from $u$ to $v$ is bounded by some (problem-specific) constant.

As a simple example, consider finding a proper vertex-coloring. For each vertex $v$ , we have some bad-events that $v$ chooses the same color as a neighbor $w$ . Observe now that $\mathcal{B}_{v}$ and $\mathcal{B}_{u}$ are dependent iff there is some common vertex $w$ , i.e. $\text{dist}(v,w)\leq 2$ . See [8] for a thorough discussion of this model of computation and applications to a number of graph-coloring problems.

Our parallel algorithm can be easily transformed into a distributed LLLL algorithm:

Proposition 5.3.

Suppose that the orderability variable-assignment criterion is satisfied with parameters $W,\epsilon$ . Then there is a distributed LOCAL algorithm to find a variable assignment avoiding $\mathcal{B}$ in $O(\frac{\log^{3}(Wn)}{\epsilon})$ rounds whp. In particular, if $epd(1+\epsilon)\leq 1$ , then this runs in $O(\frac{\log^{3}(mn)}{\epsilon})$ rounds.

Proof.

All of the steps in a round $t$ of Algorithm 7, except the computation of the LFMIS at line (6) and the state update at line (8) can be implemented in $O(1)$ communication rounds. The state update can be done in $O(\log(b_{t}n))$ rounds and the greedy LFMIS can be implemented in $O(\log^{2}(b_{t}n))$ rounds whp; note that Algorithm 7 only creates an edge between $B,B^{\prime}$ if $B,B^{\prime}$ overlap on a variable and so we can simulate the directed graph created in line (5). As shown in Appendix B we have $b_{t}\leq W\operatorname{poly}(n)$ whp. ∎

One application, which is an immediate consequence of LLLL analysis of [18], is to proper vertex coloring of a hypergraph:

Proposition 5.4.

Let $H$ be a $k$ -uniform hypergraph in which each vertex appears in at most $L$ edges. Then there is a randomized LOCAL algorithm in $O(\frac{\log^{3}n}{\epsilon})$ rounds to construct a non-monochromatic $c$ -coloring of $H$ for $L\leq\frac{c^{k}(1-1/k)^{k-1}}{k(c-1)(1+\epsilon)}$ .

6 Other resampling-spaces

We now discuss how our resampling framework applies to a few other resampling-spaces, with some applications. The main space discussed here is the uniform distribution on $S_{n}$ . Two others are the uniform distribution on hamiltonian cycles of $K_{n}$ , and the uniform distribution on perfect matchings of the complete hypergraph $K_{n}^{(s)}$ for $s\geq 2$ . The latter two involve very technical algebraic arguments, so we defer the full proofs to Appendices E and F.

6.1 Uniform distribution on $S_{n}$

In this setting, we have $U=S_{n}$ , and we use $\pi$ instead of $u$ to represent the system state. The atomic sets have the form

[TABLE]

for some $(x,y)\in[n]\times[n]$ ; we write this as $A=\langle(x,y)\rangle$ . We define $\sim$ on $\mathcal{A}$ by setting $\langle(x,y)\rangle\sim\langle(x^{\prime},y^{\prime})\rangle$ if one of the following two conditions holds: (i) $x=x^{\prime}$ and $y\neq y^{\prime}$ or (ii) $x\neq x^{\prime}$ and $y=y^{\prime}$ . Equivalently, we have $A\sim A^{\prime}$ iff $\Pr_{\Omega}(A\cap A^{\prime})=0$ .

We define $R$ to be the symmetric group $S_{n}$ . For any $A=\langle(x,y)\rangle$ , we define $R_{A}$ to be the set of single-swap permutations of the form $\sigma=(y\ z)$ for $z\in[n]$ , and $\Gamma_{A}$ is the uniform distribution on $R_{A}$ . We define the resampling action as left-multiplication in the obvious way.

Proposition 6.1.

Properties (D0), (D2) and (D3) hold.

Proof.

The monoid operation and monoid act are both composition of permutations, which can easily be done in $O(\log n)$ time. Property (D0) holds using any of the standard ways to generate uniform random permutations. ∎

Proposition 6.2.

Properties (C5) and (C1) hold.

Proof.

Consider $A=\langle(x,y)\rangle$ and $\pi\in S_{n}$ . We claim that there is precisely one pair $(z,\tau)$ with $z\in[n],\tau\in A$ such that $\pi=(y\ z)\tau$ . For, we have $(y\ z)\pi\in A$ iff $(y\ z)\pi x=y$ iff $z=\pi x$ . Furthermore, once $z$ is determined, $\tau$ is also uniquely determined.

This shows (C5). Also, when $z\approx\text{Unif}[n]$ and $\tau\approx\Omega|A$ , it implies that $(y\ z)\tau=\pi$ with probability precisely $\frac{1}{n}\times\frac{1}{|A|}=\frac{1}{n!}$ . Thus $(y\ z)\tau$ is uniformly distributed on $S_{n}$ , showing (C1). ∎

Proposition 6.3.

Property (C2) holds.

Proof.

Consider $A=\langle(x,y)\rangle$ and $A^{\prime}=\langle(x^{\prime},y^{\prime})\rangle$ and $\pi\in A-A^{\prime}$ . Clearly $A\neq A^{\prime}$ so $x^{\prime}\neq x,y^{\prime}\neq y$ . Suppose for contradiction that $(y\ z)\pi x^{\prime}=y^{\prime}$ . So $\pi x^{\prime}=(y\ z)y^{\prime}$ . If $z\neq y^{\prime}$ , then $\pi x^{\prime}=y^{\prime}$ , which contradicts $\pi\notin A^{\prime}$ . If $z=y^{\prime}$ , then $\pi x^{\prime}=y$ , which is impossible as $\pi x=y$ . ∎

Proposition 6.4.

Let $A=\langle(x,y)\rangle$ and $A^{\prime}=\langle(x^{\prime},y^{\prime})\rangle$ with $A\not\sim A^{\prime}$ . Let $\sigma=(y\ z)\in R_{A}$ and $\pi\in A\cap A^{\prime}$ . Then:

If $(x,y)=(x^{\prime},y^{\prime})$ , then $\sigma\pi\in A^{\prime}\Leftrightarrow z=y$ ; 2. 2.

If $(x,y)\neq(x^{\prime},y^{\prime})$ , then $\sigma\pi\in A^{\prime}\Leftrightarrow z\neq y^{\prime}$

Proof.

In case (1), if $z=y$ , then $\sigma\pi=\pi$ , which is in $A=A^{\prime}$ by hypothesis. If $z\neq y$ , then $\sigma\pi x=(y\ z)y=z\neq y$ , and so $\sigma\pi\not\in A$ .

In case (2), since $A\not\sim A^{\prime}$ we have $y\neq y^{\prime}$ . If $z\neq y^{\prime}$ , then $\sigma\pi x^{\prime}=(y\ z)y^{\prime}=y^{\prime}$ and so $\sigma\pi\in A^{\prime}$ . If $z=y^{\prime}$ , then $\sigma\pi x^{\prime}=(y\ y^{\prime})y^{\prime}=y\neq y^{\prime}$ , and so $\sigma\pi\notin A^{\prime}$ . ∎

Proposition 6.5.

Property (C4) holds.

Proof.

Proposition 6.4 gives an explicit condition for when $\sigma\pi\in A^{\prime}$ holds for $A\not\sim A^{\prime},\pi\in A\cap A^{\prime},\sigma\in R_{A}$ . This condition depends solely on $A,A^{\prime},\sigma$ and not on $\pi$ itself; thus, for any fixed $\sigma$ , it holds for all such $\pi$ or none of them. ∎

Proposition 6.6.

Property (C3’) holds.

Proof.

Let $A_{1}=\langle(x_{1},y_{1})\rangle$ and $A_{2}=\langle(x_{2},y_{2})\rangle$ where $A_{1}\not\sim A_{2}$ . We need to show for any fixed $\pi$ and indices $z_{1}\in[n]-\{y_{2}\},z_{2}\in[n]$ , there exist $z_{1}^{\prime}\in[n],z_{2}^{\prime}\in[n]-\{y_{1}\}$ such that

[TABLE]

If $A_{1}=A_{2}$ this is trivial. Also, if $z_{1},z_{2}$ are distinct from each other and $y_{1},y_{2}$ , then we can simply take $z^{\prime}_{1}=z_{1},z_{2}^{\prime}=z_{2}$ . Otherwise, there are a number of cases depending on which of the terms $z_{1},z_{2},y_{1},y_{2}$ are equal to each other.

Case I: $\boldsymbol{z_{1}=z_{2}}$ . Let $z=z_{1}=z_{2}$ . If $z=y_{1}$ , then $(y_{2}\ z_{2})(y_{1}\ z_{1})=(y_{2}\ y_{1})(y_{1}\ y_{1})=(y_{1}\ y_{2})(y_{2}\ y_{2})$ , and so setting $z^{\prime}_{1}=y_{2},z_{2}^{\prime}=y_{2}$ works. Otherwise, if $z\neq y_{1}$ , then $(y_{2}\ z_{2})(y_{1}\ z_{1})=(y_{1}\ y_{2}\ z)=(y_{1}\ y_{2})(y_{2}\ z)$ . So setting $z^{\prime}_{2}=z,z^{\prime}_{1}=y_{2}$ works. Our hypothesis $z\neq y_{1}$ ensures that $z^{\prime}_{2}\neq y_{1}$ .

Case II: $\boldsymbol{z_{2}=y_{2}}$ . Then $(y_{2}\ z_{2})(y_{1}\ z_{1})=(y_{1}\ z_{1})=(y_{1}\ z_{1})(y_{2}\ y_{2})$ , so take $z^{\prime}_{1}=y_{1},z^{\prime}_{2}=y_{2}$ .

Case III: $\boldsymbol{z_{2}=y_{1}}$ . We may assume that $z_{2}\notin\{z_{1},y_{2}\}$ , as we have already covered these cases. Then $(y_{2}\ z_{2})(y_{1}\ z_{1})=(y_{1}\ z_{1}\ y_{2})=(y_{1}\ z_{1})(y_{2}\ z_{1})$ , so taking $z^{\prime}_{2}=z^{\prime}_{1}=z_{1}$ works. Note that $z_{2}^{\prime}\neq y_{1}$ , as otherwise we would have $z_{1}=z_{2}$ .

Case IV: $\boldsymbol{z_{1}=y_{1}}$ . Then $(y_{2}\ z_{2})(y_{1}\ z_{1})=(y_{1}\ y_{1})(y_{2}\ z_{2})$ , so take $z^{\prime}_{1}=y_{1},z^{\prime}_{2}=y_{2}$ . Note that we cannot have $z^{\prime}_{2}=y_{1}$ as this would imply $y_{1}=y_{2}$ . ∎

6.2 Applications

We illustrate with the classic applications of the permutation LLL to Latin transversals. Suppose we have an $n\times n$ matrix $A$ , whose entries come from some set of colors. An $s$ -bounded transversal of this matrix is a permutation $\pi\in S_{n}$ , such that no color appears at least $s$ times among the entries $A(i,\pi(i))$ . The case $s=2$ is known as a Latin transversal, and in this case the permutation is said to be rainbow in that no color is repeated among the entries of $A(i,\pi(i))$ .

Proposition 6.7.

Suppose that each color appears at most $\Delta$ times in $A$ . Then, we can find a Latin transversal $\pi\in S_{n}$ in $O(\log^{4}n)$ time and $\operatorname{poly}(n)$ processors for $\Delta\leq 0.105n$ . We can find an $s$ -bounded transversal $\pi\in S_{n}$ in $O(\frac{\log^{4}n}{\epsilon})$ time and $\operatorname{poly}(n)$ processors for $\Delta\leq n\Bigl{(}\frac{(s-1)!}{2e(1+\epsilon)s}\Bigr{)}^{1/(s-1)}$ .

Proof.

We use the probability space of the uniform distribution over $S_{n}$ . For the first result, observe that the cluster-expansion LLL criterion is satisfied with slack of $\epsilon=\Omega(1)$ and $W\leq\operatorname{poly}(n)$ .

For the second result, for each tuple $t=\{(i_{1},j_{1}),\dots,(i_{s},j_{s})\}$ with $A(i_{1},j_{1})=\dots=A(i_{s},j_{s})$ , we have a separate bad-event $B_{t}$ , that $\pi(i_{1})=j_{1}\wedge\dots\wedge\pi(i_{s})=j_{s}$ . Each $B_{t}$ has probability $p\leq\frac{(n-s)!}{n!}$ , and has at most $d=2sn\binom{\Delta-1}{s-1}$ neighboring bad-events $B_{t^{\prime}}$ . Thus, in order to satisfy the symmetric LLL criterion with $\epsilon$ -slack, we need

[TABLE]

To show this, we calculate:

[TABLE]

So $epd(1+\epsilon)\leq 1$ holds under the stated hypothesis. One can easily construct a BEC in $O(\log n)$ time: for each color class, simply enumerate all of the current entries of $\pi$ with that color. ∎

We note that the runtime in Proposition 6.7 does not depend on $s$ . By contrast, the permutation LLL algorithm of [22] would only give a parallel algorithm for constant $s$ . There are two main reasons it has poor scaling as a function of $s$ : first, the number of bad-events could be $n^{s}$ , which is super-polynomial for unbounded $s$ ; second, each bad-event spans $s$ entries, whereas [22] only allows bad-events to use polylogarithmic entries. We also note that a sequential algorithm of [22] based on partial resampling can achieve better bounds for large $s$ , but our parallelization strategy does not extend to that case.

We next illustrate with some applications to finding rainbow subgraphs of $K_{n}$ and $K_{n}^{(s)}$ :

Proposition 6.8.

Consider an edge-coloring of $K_{n}$ where every color appears on at most $\Delta$ edges. If $\Delta\leq 0.105n$ and $n$ is even, then we can find a rainbow perfect matching in $O(\log^{4}n)$ time and $\text{poly}(n)$ processors whp. If $\Delta\leq 0.026n$ , then we can find a rainbow hamiltonian cycle in $O(\log^{4}n)$ time and $\text{poly}(n)$ processors whp.

Proof.

We encode these problems via the probability spaces of the uniform distribution of perfect matchings of $K_{n}$ and hamiltonian cycles of $K_{n}$ , respectively. In Apppendices E and F we show that the spaces both have amenable resampling oracles. It is shown in [29] and [24], respectively, that that cluster-expansion LLL criterion is satisfied with slack $\epsilon=O(1)$ and $W\leq\text{poly}(n)$ . ∎

Proposition 6.9.

Consider an edge-coloring of $K_{n}^{(s)}$ where every color appears on at most $\Delta$ edges. If $\Delta\leq\frac{\binom{n-s-1}{s-1}(1-\frac{1}{2s})^{2s}}{2s-1}n$ , then there is a poly-time algorithm to find a rainbow perfect matching.

Proof.

The probability space $\Omega$ is defined by selecting matching $M$ uniformly at random. For each pair of edges $e,e^{\prime}$ of the same color, we have a bad-event $B$ that $e,e^{\prime}$ are both in $M$ . This event has probability

[TABLE]

In Appendix F, we show that $\Omega$ has a resampling-space, albeit not a commutative one. To apply the cluster-expansion criterion, we use a slightly denser dependency graph: two events $B,B^{\prime}$ are dependent if the corresponding edges overlap. To enumerate the stable sets of neighbors of $B$ with respect to this dependency graph, for each of the $2s$ vertices $j$ involved in $B$ we may select another edge $f_{j}\ni j$ and another edge $f^{\prime}_{j}$ of the same color as $f_{j}$ (a total of $\binom{n-1}{s-1}\times(\Delta-1)$ choices).

We set $\mu(B)=\alpha$ for every bad-event for some parameter $\alpha\geq 0$ . In order to satisfy the cluster-expansion criterion, we will then need

[TABLE]

Simple calculus shows that when the hypotheses are satisfied, then Eq. (2) can be satisfied for some $\alpha\geq 0$ . Using the resampling oracle in Appendix F, we can implement Algorithm 2 in polynomial time to produce a configuration avoiding $\mathcal{B}$ . ∎

As another application, consider strong coloring: given a graph $G$ with a partition of the vertices into $k$ blocks each of size $b$ (i.e., $V=V_{1}\sqcup\dots\sqcup V_{k}$ ), we would like to find a proper $b$ -coloring such that every block has exactly $b$ colors. In [26], Haxell showed that such a coloring exists when $b\geq c\Delta$ and $\Delta$ is sufficiently large, for some constant $c\leq 11/4$ ; this is the best bound currently known. Furthermore, the constant $11/4$ cannot be improved to any number strictly less than $2$ . In [22], a variety of LLL-based algorithms are given for constructing the colorings, with worse bounds on $b$ and with large (unspecified) runtimes. Our LLLL algorithms gives a crisp result, which is perhaps the first parallel algorithm with reasonable bounds on both $b$ and the run-time:

Proposition 6.10.

Given a partition of $G$ into blocks of size $b\geq(\frac{256}{27}+\epsilon)\Delta$ , a coloring of $G$ can be found in $O(\frac{\log^{4}n}{\epsilon})$ time whp.

Proof.

Consider the probability space of uniform distribution over permutations $\pi_{1},\dots,\pi_{k}$ , wherein each $\pi_{i}$ is a permutation of the vertices in block $V_{i}$ . For each edge $e=(v,v^{\prime})$ with $v\in V_{i},v^{\prime}\in V_{i^{\prime}}$ , and each value $\ell=1,\dots,b$ , we have a bad-event $\pi_{i}(v)=\ell\wedge\pi_{i^{\prime}}(v^{\prime})=\ell$ . Harris & Srinivasan [22] show that this satisfies the LLLL cluster-expansion criterion with $\epsilon$ -slack when $b\geq(\frac{256}{27}+\epsilon)\Delta$ . Furthermore, the probability space is the cartesian product of $k$ copies of the uniform distribution on $S_{b}$ . By Observation 2.6, this has an amenable resampling-space. ∎

We note that, subsequent to the original version of this paper, a variety of works have appeared with better bounds on the colors and the runtime for strong coloring [14, 15, 20]. Most recently, [20] provides a deterministic sequential poly-time algorithm for $b\geq(3+\epsilon)\Delta$ and a deterministic parallel algorithm with $O(\log^{3}n)$ runtime for $b\geq(5+\epsilon)\Delta$ , for any constant $\epsilon>0$ .

Finally, we consider a hypergraph packing problem of Lu & Székély [31].

Proposition 6.11.

Let $H_{1},H_{2}$ be two $s$ -uniform hypergraphs on $n$ vertices, where each $H_{i}$ has $m_{i}$ edges such that $(d_{1}+1)m_{2}+(d_{2}+1)m_{1}<\frac{\binom{n}{s}}{e(1+\epsilon)}$ .

There is an algorithm in $\operatorname{poly}(n)$ processors and $\tilde{O}(\frac{\log^{4}n}{\epsilon})$ time to find an injective map $\phi:V(H_{2})\rightarrow V(H_{1})$ such that $\phi(H_{2})$ is edge-disjoint to $H_{1}$ . (That is, there are not edges $f_{1}\in H_{1},f_{2}\in H_{2}$ with $f_{1}=\{\phi(v)\mid v\in f_{2}\}$ .)

Proof.

Let us briefly review a construction of [31]. We use the LLL to construct the permutation $\phi$ . For each pair of edges $f_{1}=\{u_{1},\dots,u_{s}\}\in E(H_{1}),f_{2}=\{v_{1},\dots,v_{s}\}\in E(H_{2})$ , and each permutation $\sigma\in S_{s}$ , we form a bad-event that $\phi(v_{1})=u_{\sigma 1}\wedge\dots\wedge\phi(v_{r})=u_{\sigma s}$ . The stated hypothesis ensures that these events satisfy the symmetric LLL criterion. Furthermore, there is a simple BEC here which can be implemented in $O(\log n)$ time: for each $f_{2}$ , we sort $\phi(f_{2})$ and check if it in $H_{1}$ . ∎

Note that Harris & Srinivasan [22] only gives an RNC algorithm if the hypergraphs $H_{i}$ have rank $\text{polylog}(n)$ ; this condition is not required for Proposition 6.11.

7 Acknowledgments

Thanks to Chen Meiri for explanations about group actions. Thanks to anonymous conference and journal reviewers for helpful suggestions and corrections.

Appendix A Background on the LLLL

Consider some resampling-space $\mathcal{B}$ with a lopsidependency relation $\sim$ . The simplest criterion for the LLL or the LLLL on $\mathcal{B}$ is the symmetric criterion $epd\leq 1$ , where $p$ is the maximum probability of any bad-event and $d$ is the maximum dependency of any bad-event. A number of other criteria such as the asymmetric criterion can also be stated in terms of the probabilities and dependency-structure of the bad-events; the most general of these is Shearer’s criterion [37]. Parallel algorithms usually need a slightly stronger criterion which we refer to as $\epsilon$ -slack: the vector of probabilities $(1+\epsilon)\Pr_{\Omega}(B)$ must satisfy Shearer’s criterion for $\epsilon>0$ .

We will describe the Shearer criterion in terms of stable-set sequences, which is a more useful tool for analyzing the MT algorithms. The connection between stable-set sequences and the original form of Shearer’s criterion was developed by Kolipaka & Szegedy [28].

We say that a set $J\subseteq\mathcal{B}$ is stable if there are not distinct elements $B,B^{\prime}\in J$ with $B\sim B^{\prime}$ . For a stable set $J$ , we define $\overline{N}(J)=\bigcup_{B\in J}\overline{N}(B)$ .

We define a stable-set sequence to be a sequence $S=(S_{1},S_{2},\dots,S_{\ell})$ , where each $S_{i}$ is a non-empty stable set of $\mathcal{B}$ and $S_{i}\subseteq\overline{N}(S_{i+1})$ for $i=1,\dots,\ell-1$ . We say that $S$ is singleton and rooted at $B$ if $S_{\ell}=\{B\}$ . We define the depth of $S$ to be $\ell$ , the size of $S$ to be $|S|=\sum_{i=1}^{\ell}|S_{i}|$ and the weight of $S$ to be $w(S)=\prod_{i=1}^{\ell}\prod_{B\in S_{i}}\Pr_{\Omega}(B)$ . We define $\mathfrak{S}$ to be the set of all singleton stable-set sequences.

Theorem A.1 ([28]).

If Shearer’s criterion is satisfied with $\epsilon$ -slack, then $\sum_{S\in\mathfrak{S}}(1+\epsilon)^{|S|}w(S)<\infty$ .

In light of Theorem A.1, we define the key parameter $W=\sum_{S\in\mathfrak{S}}(1+\epsilon)^{|S|}w(S)$ . This allow us to state the most general bounds. However, Shearer’s criterion is difficult to work with in practice, so a number of simpler LLL criteria are often used instead.

Theorem A.2.

1. (Asymmetric LLL criterion) Suppose that some function $x:\mathcal{B}\rightarrow[0,1]$ satisfies

[TABLE]

Then Shearer’s criterion is satisfied with $\epsilon$ -slack and $W\leq\sum_{B\in\mathcal{B}}\frac{x(B)}{1-x(B)}$ .

2. (Cluster-expansion criterion [5]) Suppose that some function $\mu:\mathcal{B}\rightarrow[0,\infty)$ satisfies

[TABLE]

Then Shearer’s criterion is satisfied with $\epsilon$ -slack and $W\leq\sum_{B\in\mathcal{B}}\mu(B)$ .

3. (Symmetric LLL criterion) Suppose that $\Pr_{\Omega}(B)\leq p$ and $|\overline{N}(B)|\leq d$ for every $B\in\mathcal{B}$ , and $epd(1+\epsilon)\leq 1$ . Then Shearer’s criterion is satisfied with $\epsilon$ -slack and $W\leq emp$ .

For each bad-event $B\in V_{i}$ during Algorithm 8, we define a corresponding sequence $\hat{S}(B,i)=(S_{1},\dots,S_{i})$ by setting $S_{i}=\{B\}$ and then, for $j=i-1,\dots,1$ , setting $S_{j}=I^{\prime}_{j}\cap\overline{N}(S_{j+1})$ .

Proposition A.3.

For $B\in V_{i}$ , the sequence $\hat{S}(B,i)$ is a stable-set sequence of depth $i$ rooted at $B$ .

Proof.

Clearly $\hat{S}(B,i)$ has depth $i$ and $S_{i}=\{B\}$ , and also clearly $S_{j}\subseteq\overline{N}(S_{j+1})$ . Since $I^{\prime}_{j}$ is stable, so is each $S_{j}$ . Finally, to show that $S_{j}$ is non-empty, consider some $A\in S_{j+1}$ ; note that Lemma 4.5 ensures that there is some $A^{\prime}\in\overline{N}(A)\cap I^{\prime}_{j-1}$ ; this $A^{\prime}$ will appear in $S_{j}$ . ∎

We say that a given depth- $i$ stable-set sequence $S$ rooted at $B$ appears if $\hat{S}(B,i)=S$ . Iliopoulos [27] showed a connection between appearing stable-sequences and probabilities of bad-events in Algorithm 2 for a commutative resampling oracle. These bounds also apply to Algorithm 8 since it is a version of Algorithm 2. We summarize the key result as follows:

Proposition A.4 ([27]).

For a commutative resampling oracle, any stable-set sequence $S$ appears with probability at most $w(S)$ .

Using our bounds on stable-set sequences and arguments from [16], we now prove Lemma 4.6:

Proof of Lemma 4.6.

Each $B\in V_{i}$ corresponds to an appearing depth- $i$ stable-set sequence $\hat{S}(B,i)$ . All such stable-set sequences are distinct: if $i\neq i^{\prime}$ , then the depths of $\hat{S}(B,i)$ and $\hat{S}(B,i^{\prime})$ are distinct, while if $B\neq B^{\prime}$ then the roots of $\hat{S}(B,i)$ and $\hat{S}(B^{\prime},i)$ are distinct.

Thus, $\sum_{i}|V_{i}|$ is at most the number of appearing stable-set sequences. Proposition A.4 shows that $\mathbf{E}[|V_{i}|]\leq\sum_{S\in\mathfrak{S}}w(S)\leq W$ . So by Markov’s inequality, $\sum_{i}|V_{i}|\leq\text{poly}(n)W$ whp.

If Algorithm 8 runs for $t$ rounds, then for each $i=1,\dots,t$ , there is at least one appearing depth- $i$ stable set sequence (namely $\hat{S}(B,i)$ for an arbitrary $B\in V_{i}$ ). Thus, a necessary condition for Algorithm 8 to run for $t$ rounds is that at least $t/2$ distinct singleton stable-set sequences of size at least $t/2$ appear. By Proposition A.4, the expected number of such sequences is given by

[TABLE]

By Markov’s inequality, the probability that the actual number exceeds $t/2$ is at most $\frac{(1+\epsilon)^{-t/2}W}{t/2}$ . This is below $n^{-\Omega(1)}$ for some $t=\Theta(\log(n+W\epsilon)/\epsilon)$ . ∎

Appendix B Alternative variable-assignment LLLL criterion

We summarize here an alternate criterion of Harris for the variable-assignment LLLL [18].

Given a bad-event $B$ of the variable-assignment LLLL and a set $E\subseteq\overline{N}(B)$ , we say that $E$ is orderable to $B$ if either $E=\{B\}$ , or there is an ordering $B\equiv X_{i_{1}}=j_{1}\wedge\dots\wedge X_{i_{k}}=j_{k}$ and an ordering $E=\{B_{1},\dots,B_{k^{\prime}}\}$ such that, for each $\ell=1,\dots,k$ , the bad-event $B_{\ell}$ demands $X_{i_{\ell}}\neq j_{\ell}$ and none of the events $B_{\ell^{\prime}}$ for $\ell^{\prime}<\ell$ do so. We also say that a map $\mu:\mathcal{B}\rightarrow[0,\infty)$ satisfies the orderability criterion with $\epsilon$ -slack for $\mathcal{B}$ if it satisfies

[TABLE]

The main result of [18] is the following:

Theorem B.1.

Suppose that the map $\mu$ satisfies the orderability criterion with $\epsilon$ -slack for $\mathcal{B}$ . Then the expected number of resampling executed by the MT algorithm is at most $\sum_{B\in\mathcal{B}}\mu(B)$ .

To show this, [18] defined a type of witness tree, which differs slightly from the witness trees in the original analysis of Moser & Tardos and from the stable-set sequences discussed in Appendix A. Let us summarize very briefly. Suppose we have run the sequential MT algorithm up to some time $T$ , resampling bad-events $B_{1},\dots,B_{T}$ , and that some event $A$ is currently true. To generate the witness tree $\hat{\tau}_{A,T}$ , we start with a root node labeled $A$ . For each $\ell=T,T-1,\dots,1$ , we try to add a node to the tree with label $B_{\ell}$ , placing it as a child of some node labeled $C$ with $C\sim B_{\ell}$ . If there are multiple eligible positions we place the node at greatest depth (breaking ties arbitrarily).

However, one additional condition is enforced: for any node $v\in\hat{\tau}_{A,T}$ with label $C$ , the children of $v$ must have distinct labels $C_{1},\dots,C_{s}$ such that $\{C_{1},\dots,C_{s}\}$ is orderable to $C$ . A node $v$ is not eligible to have a child node labeled $B$ , if adding such node would violate this condition.

We say that a labeled tree $\tau$ appears if $\hat{\tau}_{A,t}=\tau$ for any event $A$ and time $t$ . We define $\mathfrak{S}^{\prime}$ to be the set of all possible labeled trees that could appear. The key lemma of [18] is the following:

Lemma B.2.

Any labeled tree $\tau$ appears with probability at most $w(\tau)$ . Furthermore, we have $\sum\limits_{\tau\in\mathfrak{S}^{\prime}}(1+\epsilon)^{|\tau|}w(\tau)\leq W$ where we define $W=\sum_{B\in\mathcal{B}}\mu(B)$ .

Algorithm 7 can be viewed as a simulation of the sequential MT algorithm, so this same lemma applies to it. By using arguments of [18] for a similar parallel resampling algorithm, we can see that if a bad-event $B$ is true after $T$ total resamplings in the middle of round of $t$ of Algorithm 8, then witness tree $\hat{\tau}_{B,T}$ has depth $t$ and is rooted at $B$ . This allows us to show a result analogous to Lemma 4.6 in terms of the orderability criterion, and thereby to show Theorem 5.1. Since the proof is nearly identical to Lemma 4.6 and Theorem 4.1, we omit it here.

Appendix C Proof of Theorem 2.4

We suppose here we have a resampling-space $\mathcal{A},R,U,\Omega,\sim$ satisfying conditions (C1), (C2), (C4). At later stages in the proof we may also assume it satisfies conditions (C3’) and (C5).

It will be convenient to work with ordered sequences from $\mathcal{A}$ . We say that $H=(A_{1},\dots,A_{k})$ is a stable list if $A_{i}\not\sim A_{j}$ for $i\neq j$ . For a permutation $\pi\in S_{k}$ , we define $\pi H=(A_{\pi 1},\dots,A_{\pi k})$ . Likewise, we define $R_{H}$ to be the set of products $h_{k}\cdots h_{1}$ wherein $h_{i}\in R_{A_{i};A_{i+1},\dots,A_{k}}$ . Whenever we discuss resampling an event $C=\langle E\rangle$ and we write $E=\{A_{1},\dots,A_{k}\}$ , then we tacitly assume that we have chosen to order the elements of $E$ as $A_{1},\dots,A_{k}$ , so that $R_{C}=R_{H}$ for the stable list $H=(C_{1},\dots,C_{k})$ .

Proposition C.1.

$\overline{\mathcal{A}}$ * satisfies (C1).*

Proof.

Consider $C=\langle A_{1},\dots,A_{k}\rangle$ . Let $r_{1},\dots,r_{k}$ be independent variables, wherein $r_{i}$ is drawn from $\Gamma_{A_{i};A_{i+1},\dots,A_{k}}$ . We need to show that when $u\approx\Omega|C$ , then $r_{k}\dots r_{1}u\approx\Omega$ .

For each $i=0,\dots,k$ let us define $Q_{i}=A_{i+1}\cap\dots\cap A_{k}$ and $u_{i}=r_{i}\dots r_{1}u$ . Since each $r_{i}$ is chosen from $R_{A_{i};Q_{i+1}}$ , we see that $u_{i}\in Q_{i}$ with probability one for all $i$ . We will show that that $u_{i}\approx\Omega|Q_{i}$ by induction on $i$ . The base case $i=0$ is given to us by hypothesis (since $C=A_{1}\cap\dots\cap A_{k}$ ), and the case $i=k$ is what we are trying to prove.

Consider a state $\tilde{u}\approx\Omega|A_{i}$ and $\tilde{r}\approx\Gamma_{A_{i}}$ . For any $v\in U$ , property (C1) gives $\Pr(\tilde{r}\tilde{u}=v)=\Omega[v]$ . If $\tilde{r}\tilde{u}\in Q_{i+1}$ , then we claim that $\tilde{u}\in Q_{i+1}$ ; for, if $\tilde{u}\notin A_{j}$ for some $j>i$ , then by property (C2) $\tilde{r}\tilde{u}\notin A_{j}$ as well. Similarly, if $\tilde{r}\tilde{u}\in Q_{i+1}$ , then $\tilde{r}\in R_{A_{i};A_{j}}$ ; for if $\tilde{r}\notin R_{A_{i};A_{j}}$ , then by property (C4) we would have $\tilde{r}\tilde{u}\notin A_{j}$ . Thus, for $v\in Q_{i+1}$ , we have

[TABLE]

By induction hypothesis, $u_{i}$ and $\tilde{u}\mid\tilde{u}\in Q_{i+1}$ both have the distribution $\Omega|Q_{i}$ . Likewise, $r_{i}$ and $\tilde{r}\mid\tilde{r}\in R_{A_{i};Q_{i+1}}$ both have the distribution $\Gamma_{A_{i};Q_{i+1}}$ . Furthermore, the variables $\tilde{u},\tilde{r}$ are independent and the variables $u_{i},r$ are independent. This implies that

[TABLE]

So $\Omega[v]=\Pr(r_{i}u_{i}=v)\Pr(\tilde{u}\in Q_{i+1})\Pr(\tilde{r}\in R_{A_{i};\{A_{i+1},\dots,A_{k}\}})$ . This shows that $\Pr(r_{i}u_{i}=v)$ is proportional to $\Omega[v]$ for any $v\in Q_{i+1}$ . Since $r_{i}u_{i}\in Q_{i+1}$ with probability one, this implies that $u_{i+1}=r_{i}u_{i}\approx\Omega|Q_{i+1}$ . ∎

Proposition C.2.

$\overline{\mathcal{A}}$ * satisfies (C2).*

Proof.

Consider $C=\langle A_{1},\dots,A_{k}\rangle$ and $C^{\prime}=\langle E^{\prime}\rangle$ with $C\not\sim C^{\prime}$ , and let $u\in C-C^{\prime}$ . Consider $r=r_{k}\dots r_{1}\in R_{C}$ . There must exist some $A^{\prime}\in E^{\prime}$ such that $u\notin A^{\prime}$ . We can show that that $r_{i}\dots r_{1}u\notin A^{\prime}$ for all $i$ , by an induction on $i$ : the base case $i=0$ holds since $u\notin A^{\prime}$ , and the induction step follows from property (C2) applied to event $A_{i}$ and $A^{\prime}$ .

At $i=k$ , this shows that $ru=r_{k}\cdots r_{1}u\notin A^{\prime}\supseteq C$ . ∎

Proposition C.3.

Let $C=\langle A_{1},\dots,A_{k}\rangle$ and $C^{\prime}=\langle A_{k+1},\dots,A_{\ell}\rangle$ be events in $\overline{\mathcal{A}}$ where $C\not\sim C^{\prime}$ . For any state $u\in C\cap C^{\prime}$ and $r\in R_{C}$ , the following are equivalent:

$ru\in C^{\prime}$ ** 2. 2.

There exist $r_{1},\dots,r_{k}$ such that $r=r_{k}\cdots r_{1}$ and $r_{i}\in R_{A_{i};A_{i+1},\dots,A_{k},A_{k+1},\dots,A_{\ell}}$ for all $i=1,\dots,k$

Proof.

For (2) $\Rightarrow$ (1), a simple induction on $i$ shows that $r_{i}\cdots r_{1}u\in C^{\prime}$ for $i=0,\dots,k$ .

For (1) $\Rightarrow$ (2), the definition of $R_{C}$ shows $r=r_{k}\dots r_{1}$ where each $r_{i}$ is in $R_{A_{i};A_{i+1},\dots,A_{k}}$ . If $r_{i}\in R_{A_{i};E^{\prime}}$ for $i=1,\dots,k$ we are done; otherwise, let $i$ be minimal such that $r_{i}\notin R_{A_{i};A_{j}}$ for some $j>k^{\prime}$ . So $u^{\prime}=r_{i}\dots r_{1}u\notin A_{j}$ . Since $C\not\sim C^{\prime}$ , by repeated applications of (C2), we see also that $r_{k}\dots r_{1}u=r_{k}\dots r_{j+1}u^{\prime}$ is also not in $A_{j}$ and hence not in $C^{\prime}$ . ∎

Corollary C.4.

$\overline{\mathcal{A}}$ * satisfies (C4).*

Proof.

For events $C,C^{\prime}$ with $C\not\sim C^{\prime}$ , Proposition C.3 gives an explicit condition on $r\in R_{C}$ to ensure that $ru\in C^{\prime}$ for $u\in C\cap C^{\prime}$ . This condition depends solely on $r$ , and not $u$ itself. ∎

Proposition C.5.

If $\mathcal{A}$ satisfies (C5), then $\overline{\mathcal{A}}$ satisfies (C5).

Proof.

Consider $C=\langle E\rangle$ for $E=\{A_{1},\dots,A_{k}\}$ . For each $i=1,\dots,k$ we define $G_{i}=R_{A_{i};A_{i+1},\dots,A_{k}}$ . For $i=k+1,\dots,1$ , we claim that there exists exactly one state $w_{i}\in A_{i}\cap\dots\cap A_{k}$ such that $u\in G_{k}\dots G_{i}w_{i}$ . The base case $i=k+1$ holds vacuously with $w_{i}=u$ , and the case $i=1$ is what we are trying to show.

For the induction step, we first show existence. By (C5), there exists $w_{i}\in A_{i}$ such that $w_{i+1}\in R_{A_{i};A_{i+1}}w_{i}$ . So $w_{i+1}=hw_{i}$ for some $h\in R_{A_{i};A_{i+1}}$ . By induction hypothesis, we have $w_{i+1}\in A_{j}$ for $j>i+1$ . Since $A_{i}\not\sim A_{j}$ , it must be the case that $w_{i}\in A_{j}$ and $r\in R_{A_{i};A_{j}}$ for each such $j$ . Thus, $h\in R_{A_{i};A_{i+1},\dots,A_{k}}=G_{i}$ and $w_{i}\in A_{i}\cap\dots\cap A_{k}$ .

Next, we show uniqueness. Suppose that $w_{i+1}\in G_{i}w^{\prime}$ for some $w^{\prime}\in A_{i}\cap\dots\cap A_{k}$ . Since $w^{\prime}\in A_{i}$ and $G_{i}\subseteq R_{A_{i};A_{i+1}}$ , by (C5) this implies that $w^{\prime}=w_{i}$ . ∎

Proposition C.6.

Suppose that $\mathcal{A}$ satisfies (C3’). Then for a stable list $H=(A_{1},\dots,A_{k})$ , any $u\in U$ , and any $\pi\in S_{k}$ , we have $R_{H}u=R_{\pi H}u$ .

Proof.

Since we can generate any permutation $\pi$ by swapping adjacent elements, it suffices to show this holds when $\pi=(j\ \ \ j+1)$ for some $j<k$ .

Let $r=h_{k}\cdots h_{1}\in R_{H}$ wherein each $h_{i}\in R_{A_{i};A_{i+1},\dots,A_{k}}$ . Define $u^{\prime}=h_{j-1}\dots h_{1}u$ . Note that $u^{\prime}\in A_{j}\cap A_{j+1}$ . By (C3’) applied to events $A_{j},A_{j+1}$ , there exist $h_{j}^{\prime}\in R_{A_{j}},h_{j+1}^{\prime}\in R_{A_{j+1};A_{j}}$ with $h^{\prime}_{j}h^{\prime}_{j+1}u^{\prime}=h_{j+1}h_{j}u^{\prime}$ . Since $h_{j+1}h_{j}u^{\prime}\in A_{j+2}\cap\dots\cap A_{k}$ , it must be the case that $h^{\prime}_{j}\in R_{A_{j+1};A_{j+2},\dots,A_{k}}$ and $h^{\prime}_{j+1}\in R_{A_{j+1};A_{j},A_{j+2},\dots,A_{k}}$ .

Now set $r^{\prime}=h_{k}h_{k-1}\dots h_{j+2}h_{j}^{\prime}h_{j+1}^{\prime}h_{j-1}\dots h_{1}$ . We thus have shown that $r^{\prime}\in R_{\pi H}$ . Furthermore, we have $ru=h_{k}\dots h_{1}u=h_{k}\dots h_{j+2}h_{j}^{\prime}h^{\prime}_{j+1}h_{j-1}\dots h_{1}u=r^{\prime}u$ . ∎

Proposition C.7.

If $\mathcal{A}$ satisfies (C3’), then $\overline{\mathcal{A}}$ satisfies (C3’)

Proof.

Consider events $C_{1}=\langle A_{1},\dots,A_{k}\rangle$ and $C_{2}=\langle A_{k+1},\dots,A_{\ell}\rangle$ and any $u\in C_{1}\cap C_{2}$ . By symmetry, it suffices to show that for any $r_{1}\in R_{C_{1};C_{2}},r_{2}\in R_{C_{2}}$ there are $r_{1}^{\prime}\in R_{C_{1}},r_{2}^{\prime}\in R_{C_{2}}$ with $r_{2}r_{1}u=r_{1}^{\prime}r_{2}^{\prime}u$ .

Define $H=(A_{1},\dots,A_{\ell})$ . By definition of $R_{C_{2}}$ , we have $r_{2}=h_{\ell}\cdots h_{k+1}$ where $h_{i}\in R_{A_{i};A_{i+1},\dots,A_{\ell}}$ for $i=k+1,\dots,\ell$ . By Proposition C.3, we have $r_{1}=h_{k}\cdots h_{1}$ where $h_{i}\in R_{A_{i};A_{i+1},\dots,A_{k},A_{k+1},\dots,A_{\ell}}$ for $i=1,\dots,k$ . Thus, we see that $r_{2}r_{1}\in R_{H}$ .

Now define $H^{\prime}=(A_{k+1},\dots,A_{\ell},A_{1},\dots,A_{k})$ and note that $H^{\prime}$ is a rearrangement of the list $H$ . By Proposition C.6, this implies that there exists $r^{\prime}\in R_{H^{\prime}}$ such that $r^{\prime}u=r_{2}r_{1}u$ . We can write $r^{\prime}=h_{k}^{\prime}\dots h_{1}^{\prime}h_{\ell}^{\prime}\dots h_{k+1}^{\prime}$ , wherein $h^{\prime}_{i}\in R_{A_{i};A_{1},\dots,A_{k},A_{i+1},\dots,A_{\ell}}$ for $i=k+1,\dots,\ell$ , and $h^{\prime}_{i}\in R_{A_{i};A_{i+1},\dots,A_{k}}$ for $i=1,\dots,k$ . If we set $r^{\prime}_{1}=h^{\prime}_{k}\dots h^{\prime}_{1}$ and $r_{2}^{\prime}=h^{\prime}_{\ell}\dots h^{\prime}_{k+1}$ , then $r_{1}^{\prime}\in R_{C_{1}}$ and by Proposition C.3 we have $r_{2}^{\prime}\in R_{C_{2};C_{1}}$ . We then have $r_{2}r_{1}u=r_{1}^{\prime}r_{2}^{\prime}u$ as desired. ∎

Appendix D Proof of Theorem 3.1

Consider a directed graph $G=(V,E)$ , with a permutation $\pi:[n]\rightarrow V$ chosen uniformly at random. Let $G^{\pi}$ denote the directed acyclic graph on vertex set $V$ and edge-set $\{(u,v)\mid(u,v)\in E,\pi^{-1}(u)<\pi^{-1}(v)\}$ . Let $I^{\pi}$ denote the LFMIS of $G$ with respect to $\pi$ . For any integer $j\in[n]$ , define the partial LFMIS $I^{\pi}_{j}=I^{\pi}\cap\{\pi^{-1}(1),\dots,\pi^{-1}(j)\}$ . For integers $0\leq i\leq j\leq n$ , define the residual vertex set $V^{\pi}_{(i,j]}=\{\pi^{-1}(i+1),\dots,\pi^{-1}(j)\}-I^{\pi}_{i}-\bigcup_{v\in I^{\pi}_{i}}N^{\text{out}}(v)$ and define $G^{\pi}_{(i,j]}$ to be the induced subgraph $G^{\pi}[V^{\pi}_{(i,j]}]$ .

For the purpose of analysis, it will be useful to consider a slowed-down variant of Algorithm 6 called SLOW-GREEDY, as discussed in [6]. Given integers $n_{0},n_{1},\dots,n_{k}$ , it is defined as follows:

We refer to the $i^{\text{th}}$ iteration of the loop in line (2) as epoch $i$ . We make the following observations for Algorithm 9; since the proofs are completely analogous to the undirected case, we refer to the reader to [6] for full proof details.

Proposition D.1 ([6]).

For any integers $n_{0},n_{1},\dots,n_{k}$ with $0=n_{0}\leq n_{1}\leq n_{2}\leq\dots\leq n_{k}=n$ , we have the following:

SLOW-GREEDY computes the LFMIS of $G$ with respect to $\pi$ . 2. 2.

The number of rounds in Algorithm 6 on $G$ and $\pi$ is at most the total number of rounds in SLOW-GREEDY. 3. 3.

If all directed paths in $G^{\pi}_{(n_{i-1},n_{i}]}$ have length at most $\ell$ , then epoch $i$ of SLOW-GREEDY terminates in at most $\ell$ rounds.

Algorithm 6 can be viewed as a special case of SLOW-GREEDY with $n_{0}=0,n_{1}=n,k=1$ ; in particular, this shows that Algorithm 6 correctly computes the LFMIS of $G$ with respect to $\pi$ .

We now analyze the path lengths in the subgraphs $G^{\pi}_{(i,j]}$ . For $i=0,\dots,n$ , let us define

[TABLE]

Proposition D.2.

With probability at least $1-n^{-100}$ , we have $D_{i}\leq\frac{200n\log n}{i}$ for any $i=1,\dots,n$ .

Proof.

Let us fix some vertex $v$ , and we want to show that either $v\notin V_{(i,n]}$ or $|N^{\text{in}}(v)\cap V_{(i,n]}|\leq d$ for $d=\frac{200n\log n}{i}$ . For each $k=1,\dots,n$ define $\mathcal{E}_{k}$ to be the event that $v$ is alive and has at least $d$ alive in-neighbors after step $k$ of Algorithm 5.

We compute the probability of $\mathcal{E}_{k}$ conditional on $\mathcal{E}_{1},\dots\mathcal{E}_{k-1}$ . As $\mathcal{E}_{1},\dots,\mathcal{E}_{k-1}$ are determined by $\pi(1),\dots,\pi(k-1)$ , it suffices to compute the probability of $\mathcal{E}_{i}$ conditional on $\pi(1),\dots,\pi(k-1)$ . This allows us to determine the set $A^{\prime}=A\cap N^{\text{in}}(v)$ of alive in-neighbors of $v$ after step $k-1$ . If $|A^{\prime}|<d$ , then $\mathcal{E}_{k}$ is false. Otherwise, we have $\pi(k)\in A^{\prime}$ with probability at least $\frac{d}{n-k+1}$ , in which case $v$ is removed from $A$ after iteration $k$ and $\mathcal{E}_{k}$ is false. Thus, $\Pr(\mathcal{E}_{k}\mid\mathcal{E}_{1},\dots,\mathcal{E}_{k-1})\leq 1-\frac{d}{n-k+1}$ . This implies that

[TABLE]

By definition $V_{(i,n]}^{\pi}$ contains only vertices which are alive after iteration $i$ . Thus, if $\mathcal{E}_{i}$ is false, the desired property holds for $v$ and $i$ . To finish, taking a union bound over all $n^{2}$ values of $v,i$ . ∎

Proposition D.3.

Suppose that we condition on $\pi(1),\dots,\pi(i)$ , and let $s=D_{i}j/n$ , and let $L$ denote the length of the longest path in $G^{\pi}_{(i,j]}$ . Then, with probability at least $1-n^{-5}$ , it holds that

[TABLE]

Proof.

Consider the induced graph $H=G[V^{\pi}_{(i,n]}]$ , which depends only on the values $\pi(1),\dots,\pi(i)$ . Let $d=D_{i}$ be the maximum in-degree of $H$ . We can enumerate the length $k$ paths of $H$ by choosing the final vertex in the path ( $n$ choices), and each of the $k$ previous vertices in the path ( $d$ choices each), so the number of length $k$ -paths in $H$ is at most $n\times d^{k-1}$ .

A necessary condition for a path $v_{1},\dots,v_{k}$ to survive to $G^{\pi}_{(i,j]}$ is that $\pi(v_{1})<\pi(v_{2})<\dots<\pi(v_{k})\leq j$ . Having conditioned on $\pi(1),\dots,\pi(i)$ , this event has probability

[TABLE]

Taking a union-bound over all such paths, we have

[TABLE]

If $s>\log n$ , then note that for $k=2es$ this is at most $2^{-k}\leq n^{-10}$ . If $s\leq\log n$ , then set $x=\frac{2\log n}{s}\geq 2$ and $k=\frac{10\log n}{\log x}\leq O(s)$ ; we then have $(es/k)^{k}=\exp\Bigl{(}\frac{-10\log n}{\log x}\times\log\bigl{(}\frac{10\log n}{es\log x}\bigr{)}\Bigr{)}=\exp\Bigl{(}\frac{-10\log n}{\log x}\times\log\bigl{(}\frac{5x}{e\log x}\bigr{)}\Bigr{)}$ . As $x\geq 1$ , standard analysis shows that $\log\bigl{(}\frac{5x}{e\log x}\bigr{)}\geq 0.5\log x$ for $x\geq 1$ . Thus, this is at most $\exp\bigl{(}\frac{-10\log n}{\log x}\times 0.5\log x\bigr{)}=e^{-5\log n}=n^{-5}$ . ∎

We are now ready to bound the runtime. We show a slightly tighter bound in terms of the maximum in-degree of graph $G$ .

Theorem D.4.

Let $d=\max_{v\in G}|N^{\text{in}}(v)|$ . When $\pi$ is chosen uniformly at random, then:

For $d\leq\log n$ , Algorithm 6 takes $O\Bigl{(}\frac{\log n}{\log\tfrac{2\log n}{d}}\Bigr{)}$ rounds whp. 2. 2.

For $d>\log n$ , Algorithm 6 takes $O(\log n\log\tfrac{2d}{\log n})$ rounds whp.

In particular, Algorithm 6 takes $O(\log d\log n)\leq O(\log^{2}n)$ rounds whp.

Proof.

1. By Proposition D.3 applied at $i=0,j=n$ , whp the graph $G^{\pi}_{(0,n]}$ has maximum path length $O(\frac{\log n}{\log\frac{2\log n}{s}})$ where $s=D_{i}\leq d\leq\log n$ . By Proposition D.1, this implies that Algorithm 6 terminates in $O(\frac{\log n}{\log\tfrac{2\log n}{d}})$ rounds whp.

2. We will use Proposition D.1 with parameters $k=\lceil\log_{2}\frac{4d}{\log n}\rceil$ and $n_{j}=\min(n,\frac{2^{j}n\log n}{d})$ for $j=1,\dots,k$ and $n_{0}=0$ . Note that $n_{k}=n$ as required, since $\frac{2^{k}\log n}{d}\geq\frac{4d}{\log n}\times\frac{\log n}{d}\geq 4$ .

Define $s_{i}=D_{n_{i-1}}n_{i}/n$ for $i=1,\dots,k$ . For $i=1$ , we have $s_{i}\leq dn_{1}/n\leq d\times\frac{2n\log n}{nd}\leq\log n$ . For $i\geq 2$ , Proposition D.2 shows that $D_{n_{i-1}}\leq\frac{200nd\log n}{2^{i-1}n\log n}=O(d/2^{i})$ with probability at least $1-n^{-100}$ , in which case $s_{i}\leq O(d/2^{i})\times(2^{i}n\log n/d)/n=O(\log n)$ . When these events occur, then by Proposition D.3, each graph $G^{\pi}_{(n_{i-1},n_{i}]}$ for $i\geq 1$ has maximum path length $O(\log n)$ with probability at least $1-n^{-5}$ .

By Proposition D.1, these facts imply that, whp, each epoch of SLOW-GREEDY takes $O(\log n)$ rounds. Overall, the total number of rounds over all $k$ epochs is $O(k\log n)=O(\log n\log\tfrac{2d}{\log n})$ . ∎

Appendix E Hamiltonian cycles of $K_{n}$

In order to use algebraic tools, we encode a hamiltonian cycle $(x_{1},\dots,x_{n},x_{1})$ of $K_{n}$ as the permutation $\pi=(x_{1}\ x_{2}\ x_{3}\dots\ x_{n})$ . In this way, the ground set $U$ can be viewed as the set of permutations $\pi$ consisting of precisely one cycle of length $n$ . We define $R$ to be the group $S_{n}$ with the natural group action of left-multiplication on $U$ ; thus properties (D0), (D2), (D3) are trivial.

For any sequence of distinct values $x_{1},\dots,x_{k}$ , let us define the set of permutations

[TABLE]

Note that each choice for the values for $z_{1},\dots,z_{k}$ give rise to a distinct permutation. Thus, $|T(x_{1},\dots,x_{k})|=\frac{(n-1)!}{(n-k-1)!}$ .

We are now ready to define the resampling-space itself. Let $Q$ be the set of paths $q=(x_{1},\dots,x_{k})$ where $x_{1},\dots,x_{k}$ are distinct elements of $[n]$ . We define the support of the path $q$ by $\sup(q)=\{x_{1},\dots,x_{k}\}$ . For such path $q$ , define an atomic event

[TABLE]

We define the dependency relation by setting $\langle q\rangle\sim\langle q^{\prime}\rangle$ if $\sup(q)\cap\sup(q^{\prime})\neq\emptyset$ .

For a given set $X\subseteq[n]$ , let us define $U_{X}$ to be the set of permutations in $S_{n}$ whose cycle structure consists of fixed points at each $x\in X$ , along with a single cycle on $[n]-X$ . Note that $U=U_{\emptyset}$ . There is an important permutation which “normalizes” the path $q=(x_{1},\dots,x_{k})$ , namely

[TABLE]

For $q=(x_{1},\dots,x_{k})$ , we define $\Gamma_{\langle q\rangle}$ to be to the uniform distribution on $T(x_{1},\dots,x_{k-1})\lambda_{q}$ . The following observations explain the role of $\lambda_{q}$ :

Observation E.1.

For $\pi\in S_{n}$ and path $q=(x_{1},\dots,x_{k})$ , we have $\pi\in\langle q\rangle$ iff $\lambda_{q}\pi\in U_{\{x_{1},\dots,x_{k-1}\}}$ .

Proposition E.2.

Let $A=\langle(x_{1},\dots,x_{k})\rangle$ . For $\pi\in A$ and $\sigma\lambda_{q}\in R_{A}$ , we have $\sigma\lambda_{q}\pi\in U$ .

Proof.

Let $\sigma=(x_{k-1}\ z_{k-1})\cdots(x_{1}\ z_{1})$ where $z_{i}\in[n]-\{x_{i},\dots,x_{k-1}\}$ , and $\tau_{i}=(x_{i}\ z_{i})\cdots(x_{1}\ z_{1})\lambda_{q}\pi$ for $i=0,\dots,k-1$ . We show by induction on $i$ that $\tau_{i}\in U_{\{x_{i+1},\dots,x_{k-1}\}}$ . The base case at $i=0$ is precisely Observation E.1 since $\tau_{0}=\lambda_{q}\pi$ , and the case at $i=k-1$ is what we are trying to show since $\sigma=\tau_{k-1}$ and $U_{\emptyset}=U$ .

For the induction step, we have $\tau_{i}=(x_{i}\ z_{i})\tau_{i-1}$ . The point $x_{i}$ does not appear in the cycle of $\tau_{i-1}$ by induction hypothesis. However, since $z_{i}\in[n]-\{x_{i},\dots,x_{k-1}\}$ , the point $z_{i}$ does so. Thus $\tau_{i}$ has $x_{i}$ inserted just before $z_{i}$ in its cycle, moving $x_{i}$ from a fixed point to part of its cycle. ∎

We now show that the necessary properties are satisfied.

Proposition E.3.

Properties (C5) and (C1) hold.

Proof.

Consider $A=\langle q\rangle$ for a path $q=(x_{1},\dots,x_{k})$ and let $\rho\in U$ . We claim that there is precisely one choice for the ordered pair $(\sigma,\pi)$ with $\sigma\in T(x_{1},\dots,x_{k-1})$ and $\pi\in A$ such that $\rho=\sigma\lambda_{q}\pi$ .

Since $\pi$ is uniquely determined from $\rho,\sigma$ , we will show that there is precisely one choice for $\sigma$ such that $\sigma^{-1}\rho\in\lambda_{q}A$ . By Observation E.1, this is equivalent to showing $\sigma^{-1}\rho\in U_{\{x_{1},\dots,x_{k-1}\}}$ .

Consider $\sigma=(x_{k-1}\ z_{k-1})\dots(x_{1}\ z_{1})$ where $z_{i}\in[n]-\{x_{i},\dots,x_{k}\}$ . We want to show that there is a unique choice for indices $z_{1},\dots,z_{k-1}$ such that $\sigma^{-1}\rho=(x_{1}\ z_{1})\dots(x_{k-1}\ z_{k-1})\rho$ is in $U_{\{x_{1},\dots,x_{k-1}\}}$ .

It suffices to show that for any index $j=k-1,\dots,1$ and $\tau\in U_{\{x_{j+1},\dots,x_{k-1}\}}$ , there is a unique choice for $z_{j}$ such that $(x_{j}\ z_{j})\tau\in U_{\{x_{j},\dots,x_{k-1}\}}$ . Since $\tau\in U_{\{x_{j+1},\dots,x_{k-1}\}}$ , the element $x_{j}$ appears in the full cycle, followed by some $y\notin\{x_{j+1}\dots,x_{k-1}\}$ . Now note that $(x_{j}\ z_{j})\tau$ has an additional fixed point at $x_{j}$ precisely if $z_{j}=y$ . Thus there is precisely one choice of $z_{j}$ with $(x_{j}\ z_{j})\in U_{\{x_{j},\dots,x_{k-1}\}}$ .

This shows the claim and immediately gives (C5). For (C1), note that for any $\rho\in U$ , the probability of $\rho=\sigma\lambda_{q}\pi$ , where $\sigma$ is drawn uniformly from $T(x_{1},\dots,x_{k-1})$ and $\pi$ is drawn uniformly from $A$ , is precisely $\frac{1}{|T(x_{1},\dots,x_{k-1})|}\times\frac{1}{|A|}=\frac{(n-k-1)!}{(n-1)!}\times\frac{1}{(n-k-1)!}=\frac{1}{(n-1)!}$ . ∎

Proposition E.4.

Property (C2) holds.

Proof.

Consider $A=\langle q\rangle$ for $q=(x_{1},\dots,x_{k})$ and $A^{\prime}=\langle q^{\prime}\rangle$ for $q^{\prime}=(y_{1},\dots,y_{j})$ with $A\not\sim A^{\prime}$ and $\pi\in A-A^{\prime}$ . There must exist some index $\ell<i$ with $\pi(y_{\ell})\neq y_{\ell+1}$ .

Let $\sigma\in T(x_{1},\dots,x_{k-1})$ . We claim that $\sigma\lambda_{q}\pi y_{\ell}\neq y_{\ell+1}$ so that $\sigma\lambda_{q}\pi\notin A^{\prime}$ .

To show this, define $\tau_{i}=(x_{i}\ z_{i})\cdots(x_{1}\ z_{1})\lambda_{q}\pi$ for $i=0,\dots,k-1$ , wherein $z_{j}\in[n]-\{x_{j},\dots,x_{k-1}\}$ . Suppose that $i$ is minimal such that $\tau_{i}y_{\ell}=y_{\ell+1}$ . It cannot be $i=0$ , as $\lambda_{q}y_{\ell+1}=y_{\ell+1}$ (since $y_{\ell+1}\notin\sup(q)$ ).

For this value $i>0$ , it must be either that (a) $x_{i}=\tau_{i-1}y_{\ell},z_{i}=y_{\ell+1}$ or (b) $z_{i}=\sigma_{i-1}y_{\ell},x_{i}=y_{\ell+1}$ . The former cannot occur as $\tau_{i-1}x_{i}=x_{i}$ and the latter cannot occur as $x_{i}\neq y_{\ell+1}$ . ∎

Proposition E.5.

Let $q=(x_{1},\dots,x_{k})$ and $b\in[n]-\{x_{1},\dots,x_{k}\}$ . Let $\sigma=(x_{k-1}\ z_{k-1})\cdots(x_{1}\ z_{1})$ where $z_{i}\in[n]-\{x_{i},\dots,x_{k-1}\}$ . Then $\sigma b=b$ iff $z_{1},\dots,z_{k-1}$ are all distinct from $b$ .

Proof.

The reverse direction is immediate. For the forward direction, define $\sigma_{j}=(x_{j}\ z_{j})\cdots(x_{1}\ z_{1})$ for $j=0,\dots,k-1$ and let $i\leq k-1$ be minimal such that $z_{i}=b$ . We show by induction that for $j\geq i$ we have $\sigma_{j}b\in\{x_{1},\dots,x_{k-1}\}$ . For the base case, we have $\sigma_{i}b=(x_{i}\ b)(x_{i-1}\ z_{i-1})\cdots(x_{1}\ z_{1})b=x_{i}$ . For the induction step, suppose that $\sigma_{j-1}b=x_{r}$ . If $z_{i}\neq x_{r}$ we have $\sigma_{j}b=\sigma_{j-1}b=x_{r}$ as desired. If $z_{j}=x_{r}$ , then $\sigma_{j}b=(x_{j}\ x_{r})\sigma_{j-1}x_{r}=x_{j}$ , again as desired.

Thus, if some of the $z_{i}$ are equal to $b$ then $\sigma b\in\{x_{1},\dots,x_{k-1}\}$ , and in particular $\sigma b\neq b$ . ∎

Proposition E.6.

Property (C4) holds. Furthermore, for $A=\langle q\rangle,A^{\prime}=\langle q^{\prime}\rangle$ with $A\not\sim A^{\prime},q=(x_{1},\dots,x_{k}),q^{\prime}=(y_{1},\dots,y_{j})$ , we have

[TABLE]

Proof.

Let $\ell<j$ . Consider $\sigma=(x_{k-1}\ z_{k-1})\cdots(x_{1}\ z_{1})\lambda_{q}\in R_{A}$ . For $\pi\in A^{\prime}$ , we have $\sigma\lambda_{q}\pi y_{\ell}=\sigma\lambda_{q}y_{\ell+1}=\sigma y_{\ell+1}$ ; by Proposition E.5 this is equal to $y_{\ell+1}$ iff $z_{1},\dots,z_{k-1}$ are distinct from $y_{\ell+1}$ . Thus, $\sigma\lambda_{q}\pi\in A^{\prime}$ iff $z_{1},\dots,z_{k-1}$ are distinct from $y_{2},\dots,y_{j}$ . To show (C4), note that this criterion does not depend on $\pi$ , so it either holds for all $\pi\in A\cap A^{\prime}$ or none of them. ∎

Given any event $A=\langle(x_{1},\dots,x_{k})\rangle$ and stable set $E\not\sim A$ , this result allows us to efficiently draw from $R_{A;E}$ , by selecting indices $z_{2},\dots,z_{k}$ wherein each $z_{i}$ is distinct from the tail $y_{2},\dots,y_{j}$ for each $A^{\prime}=\langle(y_{1},\dots,y_{j})$ in $E$ . In particular, this shows (D1’).

We will now show commutativity. This follows from the observation that $T(x_{1},\dots,x_{k})$ depends only on the unordered set $\{x_{1},\dots,x_{k}\}$ :

Proposition E.7.

For any distinct values $x_{1},\dots,x_{k}$ and any permutation $\pi\in S_{k}$ , we have

[TABLE]

Proof.

It suffices to consider $\pi=(j\ j+1)$ for $j<k$ . Consider $\sigma=(x_{k}\ z_{k})\cdots(x_{1}\ z_{1})$ where $z_{i}\in[n]-\{x_{i},\dots,x_{k}\}$ . We will show that there exist $w_{j},w_{j+1}$ such that $(x_{j}\ w_{j})(x_{j+1}\ w_{j+1})=(x_{j+1}\ z_{j+1})(x_{j}\ z_{j})$ with $w_{j}\notin\{x_{j},x_{j+2},\dots,x_{k}\},w_{j+1}\notin\{x_{j},x_{j+1},x_{j+2},\dots,x_{k}\}$ . In this case, replacing the terms $(x_{j+1}\ z_{j+1})(x_{j}\ z_{j})$ with $(x_{j}\ w_{j})(x_{j+1}\ w_{j+1})$ allows us to swap $x_{j},x_{j+1}$ , showing that $\sigma\in T(x_{1},x_{2},\dots,x_{j-1},x_{j+1},x_{j},x_{j+2},\dots,x_{k})$ . There are a few cases.

If all four values $z_{j},z_{j+1},x_{j},x_{j+1}$ are distinct, then $(x_{j+1}\ z_{j+1})(x_{j}\ z_{j})=(x_{j+1}\ z_{j+1})(x_{j}\ z_{j})$ and so $w_{j}=z_{j},w_{j+1}=z_{j+1}$ works. 2. 2.

If $z_{j}=z_{j+1}=z$ , then $(x_{j+1}\ z_{j+1})(x_{j}\ z_{j})=(x_{j}\ x_{j+1}\ z)=(x_{j}\ x_{j+1})(x_{j+1}\ z)$ . Thus taking $w_{j}=x_{j+1}$ and $w_{j+1}=z$ works. 3. 3.

If $z_{j+1}=x_{j}$ , then $(x_{j+1}\ z_{j+1})(x_{j}\ z_{j})=(x_{j}\ z_{j}\ x_{j+1})=(x_{j}\ z_{j})(x_{j+1}\ z_{j})$ . Thus taking $w_{j}=z_{j},w_{j+1}=z_{j}$ works. ∎

Proposition E.8.

Property (C3’) holds.

Proof.

Let $A_{1}=\langle q_{1}\rangle,A_{2}=\langle q_{2}\rangle$ where $q_{1}=(x_{1},\dots,x_{k}),q_{2}=(b_{1},\dots,b_{\ell})$ with $A_{1}\not\sim A_{2}$ . We will show that

[TABLE]

where we define $H=(x_{1},\dots,x_{k-1},b_{1},\dots,b_{\ell-1})$ . Note that $\lambda_{q}$ and $\lambda_{q^{\prime}}$ commute since $A_{1}\not\sim A_{2}$ , and by Proposition E.7 the set $T(H)$ does not depend upon the ordering of the list $H$ , and so by symmetry this will then show that $R_{A_{2}}R_{A_{1};A_{2}}=T(H)\lambda_{q}\lambda_{q^{\prime}}=R_{A_{1}}R_{A_{2};A_{1}}$ as desired.

Since $A_{1}\not\sim A_{2}$ , the values $b_{1},\dots,b_{\ell}$ are distinct from $x_{1},\dots,x_{k}$ . We have $|R_{A_{2}}|=\frac{(n-1)!}{(n-\ell)!}$ and $|T(H)|=\frac{(n-1)!}{(n-1-(\ell+k-2))!}$ . Using the explicit description of $R_{A_{1};A_{2}}$ from Proposition E.6, we calculate $|R_{A_{1};A_{2}}|=\frac{(n-1-(\ell-1))!}{(n-1-(\ell-1)-(k-1))!}$ . Thus $|R_{A_{2}}|\times|R_{A_{1};A_{2}}|=|T(H)|$ . We will show that $T(H)\lambda_{q}\lambda_{q^{\prime}}\subseteq R_{A^{\prime}}R_{A;A^{\prime}}$ ; a counting argument then shows Eq. (3).

Consider $\tau\in T(H)$ of the form

[TABLE]

where $z_{i}\notin\{x_{i},\dots,x_{k-1},b_{1},\dots,b_{\ell-1}\}$ and $c_{i}\notin\{b_{i},\dots,b_{\ell-1}\}$ .

If $z_{i}\neq b_{1}$ , then $\lambda_{q^{\prime}}(x_{i}\ z_{i})=(x_{i}\ z_{i})\lambda_{q^{\prime}}$ . Otherwise, for $z_{i}=b_{1}$ , we have $\lambda_{q^{\prime}}(x_{i}\ z_{i})=\lambda_{q^{\prime}}(x_{i}\ b_{1})=(x_{i}\ b_{\ell}\dots b_{1})=(x_{i}\ b_{\ell})\lambda_{q^{\prime}}$ . This shows that $\lambda_{q^{\prime}}(x_{k-1}\ z_{k-1}^{\prime})\cdots(x_{1}\ z_{1}^{\prime})=(x_{k-1}\ z_{k-1})\cdots(x_{1}\ z_{1})\lambda_{q^{\prime}}$ , where $z^{\prime}_{i}$ is defined as

[TABLE]

So we have shown that $\tau\lambda_{q}\lambda_{q^{\prime}}=(b_{\ell-1}\ c_{\ell-1})\cdots(b_{1}\ c_{1})\lambda_{q^{\prime}}(x_{k-1}\ z_{k-1}^{\prime})\cdots(x_{1}\ z_{1}^{\prime})\lambda_{q}$ . Since $z_{i}\notin\{x_{i},\dots,x_{k-1},b_{1},\dots,b_{\ell-1}\}$ , likewise $z^{\prime}_{i}\notin\{x_{i},\dots,x_{k-1},b_{2},\dots,b_{\ell}\}$ . So, by Proposition E.6 we have $(x_{k-1}\ z^{\prime}_{k-1})\cdots(x_{1}\ z^{\prime}_{1})\lambda_{q^{\prime}}\in R_{A;A^{\prime}}$ . Clearly, $(b_{\ell-1}\ c_{\ell-1})\cdots(b_{1}\ c_{1})\lambda_{q}\in R_{A}$ . So we have shown that $\tau\lambda_{q}\lambda_{q^{\prime}}$ can indeed be written as an element of $R_{A_{2}}R_{A_{1};A_{2}}$ . ∎

Appendix F Perfect matchings of $K_{n}^{(s)}$

Let us fix $s\geq 2$ throughout this section and $n$ a multiple of $s$ and we define $U=\mathcal{M}$ to be the set of perfect matchings of $K_{n}^{(s)}$ . Note that the case $s=2$ is the space of perfect matchings of $K_{n}$ , which has been studied more extensively, with a commutative resampling oracle given by Kolmogorov [29]. In [30], Lu, Székély & Mohr showed (non-algorithmically) that the LLLL held for all $s\geq 2$ .

We will construct an oblivious resampling-space for the uniform distribution on $\mathcal{M}$ . This gives efficient sequential algorithms. We also show that when $s=2$ , the space is commutative and is compatible with our parallel algorithm.

The probability space $\Omega$ is the uniform distribution on $\mathcal{M}$ . For every size- $s$ subset $e$ of $[n]$ , we define the atomic event

[TABLE]

The dependency relation $\sim$ is defined by setting $\langle e\rangle\sim\langle e^{\prime}\rangle$ iff $e\neq e^{\prime}$ and $e\cap e^{\prime}\neq\emptyset$ .

The monoid $R$ is the symmetric group $S_{n}$ , with the natural group action on $U$ defined by

[TABLE]

It is clear that properties (D0), (D2), (D3) hold.

Whenever we enumerate an edge $e=\{x_{1},x_{2},\dots,x_{s}\}$ , we always assume implicitly it is sorted so that $x_{1}<x_{2}<\dots<x_{s}$ . With this notation in mind, for an event $A=\langle\{x_{1},\dots,x_{s}\}\rangle$ we define the set of permutations

[TABLE]

and we define $\Gamma_{A}$ to be the uniform distribution on $R_{A}$ . Note that each choice of $z_{2},\dots,z_{s}$ gives rise to a distinct permutation, so that $\Gamma_{A}$ also corresponds to the distribution obtained by choosing each index $z_{i}$ independently and uniformly from the range the $[n]-\{x_{1},\dots,x_{i-1}\}$ .

Proposition F.1.

For any event $A=\langle e\rangle$ and any $N\in\mathcal{M}$ , there are precisely $(s-1)!$ ordered pairs $(\sigma,M)\in R_{A}\times A$ such that $\sigma M=N$ . In particular, for $s=2$ , property (C5) holds.

Proof.

Let $e=\{x_{1},\dots,x_{s}\}$ . Since $M$ is uniquely determined from $\sigma,N$ it suffices to show there are precisely $(s-1)!$ choices for $\sigma$ such that $\sigma^{-1}N\in A$ .

Consider $\sigma=(x_{2}\ z_{2})\cdots(x_{s}\ z_{s})$ where $z_{i}\in[n]-\{x_{1},\dots,x_{i-1}\}$ . For each $j=1,\dots,s$ let us define $A_{j}$ to be the set of matchings $M$ such that $\{x_{1},\dots,x_{i}\}\subseteq e$ for some $e\in M$ . We claim that, given any matching $M\in A_{j}$ , there are precisely $s-j$ choices for $z_{j+1}\in[n]-\{x_{1},\dots,x_{j}\}$ such that $(x_{j+1}\ z_{j+1})M\in A_{j+1}$ . As $N\in A_{1}=\mathcal{M}$ and $A_{s}=s$ , this will establish that there are precisely $(s-1)\cdots 1=(s-1)!$ choices for $z_{2},\dots,z_{s}$ such that $(x_{s}\ z_{s})\cdots(x_{2}\ z_{2})N=\sigma^{-1}N$ is in $A$ .

Now suppose we have chosen values $z_{2},\dots,z_{j}$ , and so $N^{\prime}=(x_{j}\ z_{j})\dots(x_{2}\ z_{2})N$ has been determined. By hypothesis, $N^{\prime}\in A_{j}$ and so $N^{\prime}$ contains an edge $e=\{x_{1},\dots,x_{j},y_{1},\dots,y_{s-j}\}$ . We have $(x_{j+1}\ z_{j+1})N^{\prime}\in A_{j+1}$ iff $x_{j+1}$ is swapped into edge $e$ , which occurs precisely when $z_{j+1}\in\{y_{1},\dots,y_{s-j}\}$ . Thus, there are $s-j$ choices for $z_{j+1}$ as we have claimed. ∎

Proposition F.2.

Property (C1) holds.

Proof.

Consider event $A=\langle e\rangle$ . By Proposition F.1, there are precisely $(s-1)!$ pairs $\sigma\in R_{A},M\in A$ which lead to a given matching $N=\sigma M$ . Thus, when $\sigma\approx\Gamma_{A}$ and $M\approx\Omega|A$ , we have $\Pr(\sigma M=N)=(s-1)!\times\frac{1}{|R_{A}|}\times\frac{1}{|A|}$ . This does not depend upon $N$ , and so $\sigma M$ is uniformly distributed. ∎

Proposition F.3.

Property (C2) holds.

Proof.

Consider $A=\langle e\rangle$ where $e=\{x_{1},\dots,x_{s}\}$ and $A^{\prime}=\langle e^{\prime}\rangle$ and $M\in A-A^{\prime}$ . We cannot have $A=A^{\prime}$ since $A-A^{\prime}$ is non-empty, and so $e,e^{\prime}$ are disjoint.

Suppose for contradiction that $e^{\prime}\in\sigma M$ for $\sigma\in R_{A}$ . Let $i\geq 2$ be maximal such that $e^{\prime}\in(x_{i}\ z_{i})\cdots(x_{s}\ z_{s})M$ . We must have $i\leq s$ , since $e^{\prime}\notin M$ . It must be the case that $z_{i}\in e^{\prime}$ . Then matching $N=(x_{i+1}\ z_{i+1})\cdots(x_{s}\ z_{s})M$ must contain an edge $(e^{\prime}-z_{i})\cup\{x_{i}\}$ . Thus, $x_{i}$ is matched to the vertices $e^{\prime}-z_{i}$ in $N$ . On the other hand, the entries $z_{i+1},\dots,z_{s}$ are all distinct from $x_{1},\dots,x_{i+1}$ ; therefore, in the matching $N$ , the entries $x_{1},\dots,x_{i}$ are not affected, and so $x_{1},\dots,x_{i}$ are matched to each other. Thus $x_{i}$ is matched in $N$ to $s-1$ vertices in $e^{\prime}$ as well as $i-1$ vertices in $e$ . Since $N$ contains only $s$ -edges, this is impossible. ∎

Proposition F.4.

Let $A=\langle e\rangle$ where $e=\{x_{1},\dots,x_{s}\}$ and $A^{\prime}=\langle e^{\prime}\rangle$ and $M\in A\cap A^{\prime}$ for $A\not\sim A^{\prime}$ . Consider $\sigma\in R_{A}$ of the form $\sigma=(x_{2}\ z_{2})\cdots(x_{s}\ z_{s})$ where $z_{i}\in[n]-\{x_{1},\dots,x_{i-1}\}$ . Let $Z=\{z_{2},\dots,z_{s}\}$ .

If $A=A^{\prime}$ , then $\sigma M\in A^{\prime}\Leftrightarrow Z\subseteq e^{\prime}$ 2. 2.

If $A\neq A^{\prime}$ , then $\sigma M\in A^{\prime}\Leftrightarrow Z\cap e^{\prime}=\emptyset$ .

Proof.

For case (1), suppose $Z\subseteq e^{\prime}=e$ . So each $(x_{i}\ z_{i})$ permutes two elements within $e$ , and thus a simple induction on $i$ shows that $(x_{i}\ z_{i})\cdots(x_{s}\ z_{s})M=M$ for all $i=k+1,\dots,2$ . In particular $\sigma M=M$ . On the other hand, let $i$ be maximal such that $z_{i}\notin e$ . Then $(x_{i+1}\ z_{i+1})\cdots(x_{s}\ z_{s})M=M$ . This $z_{i}$ will remain matched to $x_{1}$ in $(x_{2}\ z_{2})\cdots(x_{s}\ z_{s})M$ , and in particular $e\notin(x_{2}\ z_{2})\cdots(x_{s}\ z_{s})M$ .

For case (2), we have $e\cap e^{\prime}=\emptyset$ since $A\not\sim A^{\prime}$ . If $Z\cap e^{\prime}=\emptyset$ , then edge $e^{\prime}$ is unaffected in $(x_{2}\ z_{2})\cdots(x_{s}\ z_{s})M$ , and so $e^{\prime}\in M$ . On the other hand, let $i$ be maximal such that $z_{i}\in e^{\prime}$ . This $z_{i}$ remains matched to $x_{1}$ in $(x_{2}\ z_{2})\cdots(x_{s}\ z_{s})M$ , and in particular the edge $e^{\prime}$ cannot remain in $(x_{2}\ z_{2})\cdots(x_{s}\ z_{s})M$ . ∎

Proposition F.5.

Property (C4) holds.

Proof.

Proposition F.4 gives an explicit condition on when $\sigma M\in A^{\prime}$ for $A\not\sim A^{\prime},M\in A\cap A^{\prime},\sigma\in R_{A}$ . This condition depends solely on $A,A^{\prime},\sigma$ and not on $M$ . ∎

Proposition F.6.

Property (D1’) holds.

Proof.

Consider $E=\{\langle e_{1}\rangle,\dots,\langle e_{k}\rangle\}$ and $A=\langle e\rangle$ where $e=\{x_{1},\dots,x_{s}\}$ . If $e_{1},\dots,e_{k}$ are distinct from $e$ , then we can sample $\sigma=(x_{2}\ z_{2})\dots(x_{s}\ z_{s})\approx\Gamma_{A;E}$ by selecting each $z_{i}$ independently from the set $[n]-(e_{1}\cup\dots\cup e_{k})-\{x_{1},\dots,x_{i-1}\}$ . Similarly, if one of the sets $e_{i}$ is equal to $e$ , then we select $z_{i}$ independently from $e-\{x_{1},\dots,x_{i-1}\}$ . ∎

Proposition F.7.

For $s=2$ , property (C3’) holds.

Proof.

Consider $A_{1}=\langle(x_{1},y_{1})\rangle,A_{2}=\langle(x_{2},y_{2})\rangle$ and a matching $M\supseteq\{\{x_{1},y_{1}\},\{x_{2},y_{2}\}\}$ . We need to show that for any $z_{1}\in[n]-\{x_{1}\},z_{2}\in[n]-\{x_{1},y_{1},x_{2}\}$ there are $z^{\prime}_{2}\in[n]-\{x_{2}\}$ and $z^{\prime}_{1}\in[n]-\{x_{2},y_{2},x_{1}\}$ such that

[TABLE]

By relabeling, we assume without loss of generality that $x_{1}=1,y_{1}=3,x_{2}=2,y_{2}=4$ , and $z_{1},z_{2}\in\{1,\dots,6\}$ , and that either $M=\{\{1,3\},\{2,4\},\{5,6\}\}$ or $M=\{\{1,3\},\{2,4\},\{5,7\},\{6,8\}\}$ . We have exhaustively tested all choices $z_{1},z_{2}$ in both cases, verifying that there is always a choice of $z^{\prime}_{1},z^{\prime}_{2}\in\{1,\dots,8\}$ satisfying Eq. (4). ∎

Bibliography37

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Achlioptas, D., Iliopoulos, F.: Random walks that find perfect objects and the Lovász Local Lemma. Journal of the ACM 63(3), Article #22 (2016)
2[2] Achlioptas, D., Iliopoulos, F.: Focused stochastic local search and the Lovász local lemma. Proc. 27th ACM-SIAM Symposium on Discrete Algorithms (SODA), pp. 20248-2038 (2016)
3[3] Achlioptas, D., Iliopoulos, F., Sinclair, A.: Beyond the Lovász Local Lemma: point to set correlations and their algorithmic applications. Proc. 60th IEEE Symposium on Foundations of Computer Science (FOCS), pp. 725-744 (2019)
4[4] Albert, M., Frieze, A., Reed, B.: Multicoloured Hamilton Cycles. The Electronic Journal of Combinatorics 2(1), R 10 (1995)
5[5] Bissacot, R., Fernandez, R., Procacci, A., Scoppola, B.: An improvement of the Lovász Local Lemma via cluster expansion. Combinatorics, Probability and Computing 20(5), pp. 709-719 (2011)
6[6] Blelloch, G., Fineman, J., Shun, J.: Greedy sequential maximal independent set and matching are parallel on average. Proc. 24th ACM Symposium on Parallelism in Algorithms and Architectures (SPAA), pp. 308-317 (2012)
7[7] Brandt, S., Fischer, O., Hirvonen, J., Keller, B., Lempiäinen, T., Rybicki, J., Suomela, J., Uitto, J.: A lower bound for the distributed Lovász Local Lemma. Proc. 48th ACM Symposium on Theory of Computing (STOC), pp. 479-488 (2015)
8[8] Chung, K., Pettie, S., Su, H.: Distributed algorithms for the Lovász local lemma and graph coloring. Distributed Computing 30(4), pp. 261-2680 (2017)

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Oblivious resampling oracles and parallel algorithms for the Lopsided Lovász Local Lemma

Abstract

1 The Lovász Local Lemma and its algorithms

1.1 The Lopsided Lovász Local Lemma

1.2 Parallel algorithms

1.3 Our contribution and overview

Theorem 1.1** (Informal).**

1.4 Outline

1.5 Notation

2 The LLLL and resampling oracles

2.1 Framework for resampling oracles

Observation 2.1**.**

Theorem 2.2** ([25]).**

Proposition 2.3**.**

Proof.

2.2 Atomically-generated probability spaces

Theorem 2.4**.**

2.3 Efficient resampling oracles

Proposition 2.5**.**

Proof.

2.4 Cartesian products

Observation 2.6**.**

3 LFMIS for directed graphs

Theorem 3.1**.**

4 A generic parallel resampling algorithm

Theorem 4.1**.**

Corollary 4.2**.**

Proposition 4.3**.**

Proof.

Proposition 4.4**.**

Proof.

Lemma 4.5**.**

Proof.

Lemma 4.6**.**

5 The variable-assignment LLLL

5.1 Alternate LLLL criterion

Theorem 5.1**.**

Proposition 5.2**.**

Proof.

5.2 Distributed algorithms

Proposition 5.3**.**

Proof.

Proposition 5.4**.**

6 Other resampling-spaces

6.1 Uniform distribution on SnS_{n}Sn​

Proposition 6.1**.**

Proof.

Proposition 6.2**.**

Proof.

Proposition 6.3**.**

Proof.

Proposition 6.4**.**

Proof.

Proposition 6.5**.**

Proof.

Proposition 6.6**.**

Proof.

6.2 Applications

Proposition 6.7**.**

Proof.

Proposition 6.8**.**

Proof.

Proposition 6.9**.**

Proof.

Proposition 6.10**.**

Proof.

Proposition 6.11**.**

Proof.

7 Acknowledgments

Appendix A Background on the LLLL

Theorem A.1** ([28]).**

Theorem A.2**.**

Proposition A.3**.**

Theorem 1.1 (Informal).

Observation 2.1.

Theorem 2.2 ([25]).

Proposition 2.3.

Theorem 2.4.

Proposition 2.5.

Observation 2.6.

Theorem 3.1.

Theorem 4.1.

Corollary 4.2.

Proposition 4.3.

Proposition 4.4.

Lemma 4.5.

Lemma 4.6.

Theorem 5.1.

Proposition 5.2.

Proposition 5.3.

Proposition 5.4.

6.1 Uniform distribution on $S_{n}$

Proposition 6.1.

Proposition 6.2.

Proposition 6.3.

Proposition 6.4.

Proposition 6.5.

Proposition 6.6.

Proposition 6.7.

Proposition 6.8.

Proposition 6.9.

Proposition 6.10.

Proposition 6.11.

Theorem A.1 ([28]).

Theorem A.2.

Proposition A.3.

Proposition A.4 ([27]).

Theorem B.1.

Lemma B.2.

Proposition C.1.

Proposition C.2.

Proposition C.3.

Corollary C.4.

Proposition C.5.

Proposition C.6.

Proposition C.7.

Proposition D.1 ([6]).

Proposition D.2.

Proposition D.3.

Theorem D.4.

Appendix E Hamiltonian cycles of $K_{n}$

Observation E.1.

Proposition E.2.

Proposition E.3.

Proposition E.4.

Proposition E.5.

Proposition E.6.

Proposition E.7.

Proposition E.8.

Appendix F Perfect matchings of $K_{n}^{(s)}$

Proposition F.1.

Proposition F.2.

Proposition F.3.

Proposition F.4.

Proposition F.5.

Proposition F.6.

Proposition F.7.