Oblivious resampling oracles and parallel algorithms for the Lopsided Lovasz Local Lemma
David G. Harris

TL;DR
This paper introduces the concept of obliviousness in resampling oracles for the Lopsided Lovász Local Lemma, enabling faster parallel algorithms and new resampling oracles for complex combinatorial structures.
Contribution
It identifies the obliviousness property in resampling oracles, leading to a unified parallel LLLL algorithm and new resampling oracles for rainbow perfect matchings and Hamiltonian cycles.
Findings
Developed a faster parallel LLLL algorithm using obliviousness.
First RNC algorithms for rainbow perfect matchings and Hamiltonian cycles.
Constructed new sequential and commutative resampling oracles for complex structures.
Abstract
The Lov\'{a}sz Local Lemma (LLL) is a probabilistic tool which shows that, if a collection of "bad" events in a probability space are not too likely and not too interdependent, then there is a positive probability that no bad-events in occur. Moser & Tardos (2010) gave sequential and parallel algorithms which transformed most applications of the variable-assignment LLL into efficient algorithms. A framework of Harvey & Vondr\'{a}k (2015) based on "resampling oracles" extended this to general sequential algorithms for other probability spaces satisfying the Lopsided Lov\'{a}sz Local Lemma (LLLL). We describe a new structural property which holds for all known resampling oracles, which we call "obliviousness." Essentially, it means that the interaction between two bad-events depends only on the randomness used to resample , and not the precise state…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Quality and Management · Machine Learning and Algorithms · Cryptography and Data Security
Oblivious resampling oracles and parallel algorithms for the Lopsided Lovász Local Lemma
David G. Harris Department of Computer Science, University of Maryland, College Park, MD 20742. Email: [email protected]
Abstract
The Lovász Local Lemma (LLL) shows that, for a collection of “bad” events in a probability space which are not too likely and not too interdependent, there is a positive probability that no events in occur. Moser & Tardos (2010) gave sequential and parallel algorithms which transformed most applications of the variable-assignment LLL into efficient algorithms. A framework of Harvey & Vondrák (2015) based on “resampling oracles” extended this to sequential algorithms for other probability spaces satisfying a generalization of the LLL known as the Lopsided Lovász Local Lemma (LLLL).
We describe a new structural property which holds for most known resampling oracles, which we call “obliviousness.” Essentially, it means that the interaction between two bad-events depends only on the randomness used to resample , and not the precise state within itself.
This property has two major consequences. First, combined with a framework of Kolmogorov (2016), it leads to a unified parallel LLLL algorithm, which is faster than previous, problem-specific algorithms of Harris (2016) for the variable-assignment LLLL and of Harris & Srinivasan (2014) for permutations. This gives the first RNC algorithms for rainbow perfect matchings and rainbow hamiltonian cycles of .
Second, this property allows us to build LLLL probability spaces from simpler “atomic” events. This gives the first resampling oracle for rainbow perfect matchings on the complete -uniform hypergraph , and the first commutative resampling oracle for hamiltonian cycles of .
This is an extended version of a paper which appeared in the ACM-SIAM Symposium on Discrete Algorithms (SODA) 2019.
1 The Lovász Local Lemma and its algorithms
The Lovász Local Lemma (LLL) is a fundamental probabilistic tool which shows that for a probability space with a finite set of “bad” events, then as long as the bad-events are not too interdependent (in a certain technical sense) and are not too likely, there is a positive probability no events in occur. The simplest form of the LLL, known as the symmetric LLL, can be stated as follows: if every bad-event has and is dependent with at most others, where , then there is a positive probability that none of the bad-events occur.
Most combinatorial applications of the LLL use a relatively simple probability space, which we call the variable-assignment LLL. This setting has independent variables , and each bad-event is a boolean function of a subset of these variables denoted . Bad-events are dependent (written ) iff . Moser & Tardos [35] introduced a remarkably simple algorithm for this setting, which we refer to as the MT algorithm:
Moser & Tardos [35] showed that this algorithm terminates quickly whenever the symmetric LLL criterion (or a more general asymmetric LLL criterion) is satisfied. Later work [36, 28, 18] showed that it terminates under more general criteria. See Appendix A for background on the LLL and MT algorithm.
Note that the MT algorithm requires a subroutine to find a bad-event which is true on the current configuration (if any). We refer to this as a Bad-Event Checker (BEC). The simplest implemention of this is to loop over all bad-events and test them one by one, which would have a run-time on the order of . The run-time of the MT algorithm can often be polynomial in and independent of if a more-efficient BEC is used [17, 21].
1.1 The Lopsided Lovász Local Lemma
In [10], Erdős & Spencer noted that positive correlation among bad-events (again, in a certain technical sense) is as good as independence for the LLL. This generalization has been referred to as the Lopsided Lovász Local Lemma (LLLL). We say are lopsidependent and write if are neither independent nor positively correlated in this sense. (Formal definitions are provided later in Section 2.)
Although the variable-assignment LLL covers the vast majority of applications in combinatorics, the LLLL is also used occasionally. For example, the original application of the LLLL used a probability space on permutations to construct Latin transversals for certain types of arrays [10]. Other applications include hamiltonian cycles on [4], perfect matchings of [32], perfect matchings of the complete -uniform hypergraph [30], and spanning trees of [30].
The variable-assignment setting provides one of the simplest forms of the LLLL. Here, as before, there are independent variables . Instead of allowing arbitrary boolean functions of the variables, each bad-event should be a monomial function, i.e. of the form
[TABLE]
For the LLL, we would have if the bad-events and share some common variable, i.e. . For the LLLL, the (lopsi)dependency relation is more restricted: we have if and disagree on some common variable, i.e. and .
Moser & Tardos showed that their algorithm applies to the variable-assignment LLLL setting. In [22], Harris & Srinivasan developed an algorithm similar to the MT algorithm for the probability space of random permutations, which includes the Latin transversal application of [10]. Extending these problem-specific algorithms, Harvey & Vondrák [25] developed a general framework based on a “resampling oracle” for the probability space. We will define this formally in Section 2, but, intuitively this is a randomized algorithm which, given some state with some bad-event true on , attempts to “rerandomize” the configuration in a “local” way to fix . This is similar to the way that the MT algorithm resamples the variables involved in . Given this resampling oracle, the following Algorithm 2 can be used to find a configuration avoiding the bad-events:
These results have led to constructive counterparts to combinatorial results involving spanning trees and matchings of (both discussed in [25]) and hamiltonian cycles of (subsequently developed in [24]). A further line of research has extended Algorithm 2, and variants, to other spaces which do not directly correspond to the LLLL [1, 2, 3, 23].
We note that the choice of which bad-event to select in line (3) of Algorithm 2 is much more constrained than for the MT algorithm. Only a limited number of possibilities work in general, such as selecting with smallest index, whereas the MT algorithm allows nearly complete freedom. In [29], Kolmogorov showed that a number of resampling oracles (including variable-assignment, permutations, and perfect matchings of ) satisfy an additional property known as commutativity. In such cases, Algorithm 2 also allows an arbitrary choice of which bad-event to select. Kolmogorov [29] and Iliopoulos [27] further showed that this property has powerful algorithmic consequences, including parallel algorithms, efficient BEC’s, and bounds on the output distribution at the termination of Algorithm 2.
1.2 Parallel algorithms
Moser & Tardos also presented a simple parallel version of their resampling algorithm. This parallel algorithm requires a slightly stronger criterion, which we refer to as -slack; for instance, the symmetric LLL requires ; if this satisfied, then it terminates after rounds with high probability.111We say that an event occurs with high probability (abbreviated whp), if it has probability at least . On a EREW PRAM, it has overall runtime . We summarize the algorithm as follows:
Haeupler & Harris [16] showed that the parallel MT algorithm could be implemented in time (avoiding dependence on ) and gave an alternative parallel algorithm in time . The parallel MT algorithm can also usually be implemented even for more general LLL criteria, including the asymmetric LLL and Shearer’s LLL criterion [28].
(In some computational models, multiple processors can write to a memory cell simultaneously and the runtimes can often reduced by logarithmic factors. For simplicity, we will be conservative and use only the EREW PRAM model throughout this paper. We say that an algorithm is in if it runs in time and processors whp on an EREW PRAM.)
The parallel MT algorithm leads in a straightforward way to distributed graph algorithms in communication rounds. There has been extensive research into obtaining faster distributed and parallel LLL algorithms; some of these algorithms require significantly stronger (but still local) conditions on the dependency and probability of the bad-events [8, 11, 13]. Brandt et al. [7] showed that generic distributed LLL algorithms require rounds.
Frustratingly, although the sequential MT algorithm works for the variable-assignment LLLL just as it does for the variable-assignment LLL, this is not true of the parallel MT algorithm. There have been only a handful of parallel algorithms for the LLLL, such as the variable-assignment LLLL algorithm of Harris [18] and the permutation LLL algorithm of Harris & Srinivasan [22].
In [29] Kolmogorov proposed a general framework for constructing parallel LLLL algorithms via resampling oracles, which can be summarized as follows:
Each iteration of the loop of lines (3) — (7) is called a round. Kolmogorov showed that, when the resampling oracle is commutative, then Algorithm 4 terminates whp after rounds. We emphasize this is a sequential algorithm, which is in fact a version of Algorithm 2.
If a single round can be simulated in polylogarithmic time, then this yields an RNC algorithm. In almost every setting where a parallel LLLL algorithm is known (including all the ones in this paper), the resampling oracle is commutative and the parallel algorithm is an implementation of Kolmogorov’s framework.
This makes partial progress to a general parallel LLLL algorithm; however, there remain two significant hurdles. The most straightforward of these is a parallel implementation of . This is trivial for the variable-assignment LLL: if bad-events are both selected for resampling, then and must be disjoint and the resamplings can be executed simultaneously. For other probability spaces, it is not clear how to resample without “locking” the state.
The second and much more fundamental hurdle is that the LLLL resampling process is inherently sequential in a way that the LLL is not. For the LLLL (but not the LLL) it is possible that two bad-events are currently true, and , and resampling makes false. We say in this case that fixes . Because of this possibility, and cannot be resampled simultaneously; one must select (arbitrarily) one of the two bad-events to resample first, and then only resample the second one if it still remains true. One critical challenge for LLLL algorithms is to simulate in parallel the process of resampling the bad-events in sequence.
The parallel LLLL algorithms of Harris [18] and Harris & Srinivasan [22] overcome these hurdles to a limited extent. However they still suffer from a number of shortcomings. Although they run in polylogarithmic time, the exponent is quite high (and is not computed explicitly). They also require additional structure, such as having bad-events which involve a polylogarithmic number of variables. Finally, and perhaps most seriously, these algorithms are highly tailored to a single probability space. They are reminiscent of the situation for LLL algorithms before the framework of Harvey & Vondrák [25]: specialized algorithms with ad-hoc analysis.
1.3 Our contribution and overview
We identify a new property of resampling oracles that we refer to as obliviousness. To summarize, suppose we have two bad-events with , and a state . The obliviousness property states that whether fixes depends solely on the randomness used to resample , and not on the state itself. This framework is developed in Section 2. We find it remarkable that so many LLLL probability spaces, even the non-commutative ones, have oblivious resampling oracles: this includes variable-assignment, permutations, perfect matchings of , perfect matchings of the hypergraph , hamiltonian cycles of , and spanning trees of .
A unified parallel algorithm. Obliviousness allows us to sidestep the second major hurdle to a parallel LLLL algorithm. It reduces the possibility of fixing to a pairwise phenomenon: we only need to know the resampling action chosen for , not the present state (which may be changing during other resampling actions). The space of sequential resamplings can thus be represented in a simple graph structure, allowing us to efficiently find a valid sequence.
To implement this sequence in parallel, we encode as a monoid action. Specifically, can be interpreted as a randomly-chosen monoid element acting on the current state . In this way, resampling multiple bad-events can be interpreted algebraically as the product . This is easily parallelized by the associativity of monoidal multiplication.
We summarize our generic parallel LLLL algorithm as follows:
Theorem 1.1** (Informal).**
Suppose that holds for any LLLL probability space with an appropriate parallelizable resampling oracle. Then there is a parallel algorithm in time to find a state avoiding .
We summarize some notable applications of this algorithm.
Suppose we have a -SAT instance on variables and clauses, in which each variable appears in at most clauses. There is an algorithm to find a satisfying assignment. 2. 2.
For an integer , suppose that is a -uniform hypergraph where each vertex appears in at most edges. There is an randomized algorithm in rounds for the LOCAL distributed computing model to find a proper vertex -coloring of . 3. 3.
Suppose that is an matrix whose entries are labeled by colors and each color appears in at most entries. For , there is an algorithm to find a Latin transversal of . For \Delta\leq n\Bigl{(}\frac{(s-1)!}{2e(1+\epsilon)s}\Bigr{)}^{1/(s-1)} there is an algorithm to find a transversal of where color appears at most times. 4. 4.
Suppose that we have an edge-coloring of where each color appears on at most edges. If and is even, there is a algorithm to find a rainbow perfect matching. If , there is an algorithm to find a rainbow hamiltonian cycle.
Versions of the first two results with slightly worse parameters can be derived from the variable-assignment LLL and parallel MT algorithm. Previous slower RNC algorithms are known for the third result. We are not aware of any RNC algorithms comparable with the fourth result; this answers open problems posed by Kolmogorov [29] and Harvey & Liaw [24].
A new resampling framework. Beyond its direct algorithmic impact, obliviousness can simplify a number of resampling oracle constructions. Most LLLL probability spaces come from a set of relatively simple “atomic events.” For example, in the space of uniform permutations, these are events of the form . A bad-event is then taken to be a conjunction of atomic events.
It is intuitively clear that the resampling oracle for the atomic events in some sense “generates” the resampling oracle for . A formal description of this has been elusive. To illustrate the difficulty, consider a bad-event and a configuration , where are atomic events. We would like to resample by resampling and then resampling . In order to obtain the correct probability distribution, we must condition on remaining true after resampling . For a general resampling oracle, this conditioning step might distort the probability distribution of in an unmanageable way. But for an oblivious resampling oracle, we are guaranteed that conditioning on remaining true retains an independent, uniform distribution for itself.
We derive a simple list of axioms required for an oblivious resampling oracle for the atomic events only; these automatically lead to a resampling oracle for . Beyond the fact that this gives new algorithmic results, this greatly simplifies many proofs and constructions for existing resampling oracles. We highlight a few results:
We get a commutative resampling oracle, and parallel algorithms, for the space of hamiltonian cycles of . 2. 2.
We get a resampling oracle for the space of perfect matchings of the complete hypergraph . This leads to efficient (sequential) algorithms corresponding to non-constructive results on rainbow hypergraph matchings shown by Lu, Mohr, & Székély [30].
1.4 Outline
In Section 2, we formally define the LLLL in terms of resampling oracles. We provide a new framework which is more algebraic compared to the probabilistic formulation originally developed in [25]. We define the properties needed for resampling oracles, including commutativity and the new property of obliviousness. We also discuss the method for generating LLLL-compatible probability spaces from atomic events.
In Section 3, we describe a new graph algorithm needed for our parallel LLLL algorithm. This computes a structure which is similar to a lexicographically-first MIS (LFMIS), but generalized to directed graphs. This plays a similar role to the MIS in the parallel MT algorithm, but respects the sequential ordering of the bad-events. We show that, for a random vertex order, this LFMIS can be computed efficiently in rounds by a simple greedy parallel algorithm adapted from Blelloch, Fineman & Shun [6] for undirected graphs. This is a pure graph theory problem which does not directly involve the LLLL, and may be of independent interest.
In Section 4, we describe our generic LLLL algorithm in terms of a resampling oracle from the framework of Section 2.
In Section 5, we analyze the variable-assignment LLLL. We show how the simple resampling oracle (which is just to resample variables from the original distribution) fits into the formal framework of Section 2. We provide a few example applications, to -SAT and hypergraph coloring.
In Section 6, we describe a few other more “exotic” LLLL spaces, including random permutations, hamiltonian cycles, and perfect matchings. We discuss a few applications, including to strong coloring and a number of Latin transversal problems.
1.5 Notation
Throughout, we let denote the set . For a probability space over a ground set , we say that if is a random variable drawn according to distribution . We define to be the probability mass of , and we define to be the set of values with .
For any we define . We also define to be the conditional distribution on , i.e. for .
For two random variables , we say if follow the same distribution. For any set , we define to be the uniform distribution on .
For , we let denote the complete -uniform hypergraph on vertex set . For (the complete graph), we also write . We say that is a perfect matching of if it is a partition of into exactly classes of size . Whenever we refer to the set of perfect matchings of , we will assume implicitly that divides .
We define to be the symmetric group on letters, viewed concretely as the set of bijections on ground set . We write for the transposition swapping and . We also write for the functional composition , that is, the function sending to .
For subsets of an algebraic structure , we let denote the product set . Similarly, for we write and .
For a directed graph and a vertex , we define the out-neighborhood and the out-degree of is the cardinality of this set. Similarly we define the in-neighborhood , and the in-degree of is the cardinality of this set.
2 The LLLL and resampling oracles
In this section, we will formally define the LLLL and how to construct a resampling oracle for it, in the sense of Harvey & Vondrák [25]. We note that Erdős & Spencer [10] describes an alternate, probabilistic interpretation of the LLLL, which is slightly more general. Since this is technical to describe and we will never use this interpretation, we will not discuss this here.
Constructions based on the LLLL typically have two phases. First, we choose a large collection of highly-structured “generic” bad-events in a probability space, equipped with an appropriate lopsidependency relation and a resampling oracle. For example, in the variable-assignment LLLL setting, the underlying probability space is a cartesian product space with independent variables and the generic bad-events are the monomial functions of the form for arbitrary values . For the permutation setting, the underlying probability space is the uniform distribution on and the generic bad-events have the form for arbitrary values .
It is impossible to avoid all the generic bad-events. The second phase of the LLLL is to select some problem-specific, more-or-less “random”, subset of the generic bad-events. For example, if we wish to satisfy a given -SAT formula, then for each clause , we would have in the bad-event , which is one of the generic bad-events.
In order to show that the LLLL applies, and that Algorithm 2 converges to an assignment avoiding , we must show two things: first, that the resampling oracle works properly on the generic set of bad-events containing . Second, that the specific chosen subset has its probabilities and dependencies sufficiently small; for example, each bad-event has and is lopsidependent with at most other bad-events of such that .
These two phases are almost completely distinct. The first is highly algebraic, while the second is more combinatorial. In this section, we will only discuss the first phase of constructing the generic set of bad-events to be compatible with the LLLL. The second phase, for which we use only standard techniques, is discussed in Appendix A.
2.1 Framework for resampling oracles
Consider a probability space over a ground set , along with a collection of events in that space. There is also a binary symmetric relation provided for , which we refer to as the dependency relation.222More properly, this should be referred to as a “lopsidependency” relation. The distinction between dependency and lopsidependency is not important for us so we use the simpler terminology. We will define the properties needed for a resampling oracle for this space, in the sense of Algorithm 2, along with the new property “obliviousness” which we will need for our algorithms. We will later construct a number of such resampling oracles.
We will define by specifying a monoid which acts on . We refer to the -act on as the resampling action, and we write it as for . We also define, for each , a probability distribution over and we define . The intent is to define the resampling oracle as where . Note that it is very important for us to separate the role of the randomness used in .
Before we define our new obliviousness property, let us reiterate the conditions of Harvey & Vondrák [25] and Kolmogorov [29], in terms of our notation.333Kolmogorov [29] refers to property (C3) here as “strong commutativity.” We will never use the weaker commutativity properties defined by Kolmogorov, so we just refer to this as commutativity for convenience.
- (C1)
(Probability regeneration) For any and any fixed , we have
[TABLE] 2. (C2)
(Locality) If , and , then for all we have . 3. (C3)
(Commutativity) Let . For any states and , there is an injective mapping from states with , to states with , such that
[TABLE]
Observation 2.1**.**
If Properties (C1) and (C2) are satisfied, then the randomized function defined by choosing and outputting , gives a resampling oracle in the sense of Harvey & Vondrák [25]. If (C3) is also satisfied, then the resampling oracle is commutative in the sense of Kolmogorov [29].
We define a resampling-space to be an ensemble of such objects satisfying (C1) and (C2). We sometimes refer to the overall ensemble also just as . We define the neighborhood of by and we also define .
Observe that if is a resampling-space and , then is also a resampling-space (where is the restriction to ). Furthermore, if (C3) holds for then it holds for as well. We emphasize that these properties alone do not imply that that Algorithm 2 will converge when using the resampling oracle . Our usual strategy is to show that some generic set is a resampling-space with desired properties, and then take to be an arbitrary subset of . We then show that one of the LLLL convergence criteria, such as Shearer’s criterion, is satisfied on . See Appendix A for further details and definitions.
Bearing this in mind, we can summarize the main result of [25] as follows:
Theorem 2.2** ([25]).**
If is a resampling-space which satisfies Shearer’s criterion, then Algorithm 2 terminates in expected polynomial time.
We are now ready to introduce the new structural property:
- (C4)
(Obliviousness) For all pairs in with , and all , one of the following two conditions holds:
- (a)
For all we have 2. (b)
For all we have
We refer to this as obliviousness since whether is in does not depend upon the state . In light of (C4), let us define set . We also define the conditional probability distribution , and for any set we define and .
The definition of commutativity as it appears in (C3) is cumbersome to work with and lacks good compositional properties. To make it easier to show (C3), we use an additional property of resampling oracles identified by Achlioptas & Iliopoulos [1], which we refer to as injectivity.444In [1], this property is referred to as atomicity. We use the alternate terminology injectivity to avoid confusion with our discussion of atomic bad-events. We state one variant of this property as follows:
- (C5)
(Injectivity) For all and , there is exactly one with .
Our main motivation for this property is that it greatly simplifies condition (C3), allowing us to use an alternate condition (C3’) instead:
- (C3’)
For all pairs and all we have .
We summarize this in the following result:
Proposition 2.3**.**
If properties (C3’), (C4), (C5) hold, then property (C3) holds.
Proof.
We begin with a preliminary calculation: consider any . By (C1) we have . By (C5), we have only if , and so . Combining these equations, we get the following formula:
[TABLE]
Let us now show (C3). Fix . By (C5), at most one state has . If there is no such , then there is nothing to show. Otherwise, by (C3’) there must exist with . We map to this . Since there is only one possible value , the mapping is trivially injective. We need to show that this pair satisfies
[TABLE]
By Eq. (1), we have
[TABLE]
A symmetric argument shows that is also equal to this quantity. ∎
2.2 Atomically-generated probability spaces
Most known resampling-spaces have a nicer form: the bad-events are conjunctions of a limited class of “atomic” events. For example, for the variable-assignment LLLL, an atomic event is ; for the space of uniform permutations, an atomic event is . The obliviousness property allows us to formalize this: we can define a resampling oracle and a simple list of axioms for the atomic events alone, and then we automatically get a resampling oracle for conjunctions of atomic events. This vastly simplifies the constructions for a number of diverse LLLL spaces.
Let be an oblivious resampling-space. We say that a set is stable if for all distinct pairs , and we define . For , we also write as shorthand for .
Let us define to be the set of conjunctions of events of ,
[TABLE]
We will use the same ground set and monoid for . The new dependency relation for is defined by setting if there exist with .
The key to the construction is to extend the distributions for the atomic events to a probability distribution for an event in . To do so, we select some arbitrary fixed ordering as , and we then define to be the distribution over products , wherein are independent random variables and is drawn from distribution . (For , is the identity element of .)
Theorem 2.4**.**
If is an oblivious resampling-space, then so is . If, in addition, satisfies (C5) and (C3’), then so does ; in particular, is commutative.
The proof of Theorem 2.4 is technical, so we defer it to Appendix C. In later sections, we use it for a number of new and simpler constructions of resampling-spaces. Notably, these include hamiltonian cycles of and perfect matchings of . Our construction for hamiltonian cycles of is commutative, in contrast to a previous resampling oracle construction of Harvey & Liaw [24]. No resampling oracle of any kind was known for perfect matchings of for any .
2.3 Efficient resampling oracles
Our framework for resampling oracles, in which is derived from a monoid , may seem overly restrictive. In fact, it is without loss of generality: for an arbitrary resampling oracle in the sense of Harvey & Vondrák [25], we could simply take to be the full transformation monoid. This would be useless computationally, because writing down an element of would require exponential time.
In order to get an efficient parallel algorithm we must be able to efficiently compute on . We summarize the requirements in terms of four properties (D0)—(D3); the runtime bounds are chosen so that the resampling action does not become the computational bottleneck for the overall algorithm described later. Here the parameter measures the input length to the algorithm.
- (D0)
We can sample from in time and processors. 2. (D1)
For any , we can sample from in time and processors. 3. (D2)
For and , we can compute in time and processors. 4. (D3)
For , we can compute in time and processors.
For atomically-generated probability spaces, these properties can themselves be simplified:
Proposition 2.5**.**
Suppose that , such that every bad-event is given by for some stable set with . Suppose that satisfies property (D3) as well as the the following property (D1’):
- (D1’)
For any and stable set with and , we can sample from in time and processors.
Then satisfies property (D1).
Proof.
Let for some stable set with . To draw , we first use (D1’) to sample independent variables wherein each drawn from . We then use (D3) to compute in time. ∎
We say that a resampling space is amenable if it satisfies the following computational conditions:
- •
It satisfies properties (C3)–(C4).
- •
The monoid satisfies properties (D0)–(D3).
- •
It has has a BEC running in time and processors.
We will later describe a parallel algorithm for such spaces. Note that, even without these properties, the resampling-space may still be be useful for a sequential algorithm or a combinatorial existence proof. Also, note that the third condition is satisfied if and we can efficiently check each bad-event in time.
2.4 Cartesian products
Another useful method for constructing resampling-spaces comes from a cartesian product construction. Consider resampling-spaces for . We define a new resampling-space as follows. The underlying space is and is the corresponding product distribution. The monoid is the cartesian product , with the natural monoid act on . The events in are those of the form , where . For such an event , we define to be the probability distribution on tuples , wherein are independent, and is drawn from in resampling-space . The relation on is defined by if there is an index where .
The following is immediate from the definitions:
Observation 2.6**.**
If are oblivious resampling-spaces, then so is .
If in addition are commutative, then so is .
If in addition and satisfy properties (D0)–(D3), then so does .
As an example, the permutation LLL as defined in [22] allows selection of permutations , wherein each is drawn independently and uniformly from some , and a bad-event has the form . This can be modeled as the cartesian product of the uniform distributions on . Therefore, the resampling action defined by the uniform distribution on immediately gives a corresponding resampling action for the permutation LLL.
3 LFMIS for directed graphs
Before we describe the parallel LLLL algorithm, we need an important graph-theoretic subroutine: the LFMIS for directed graphs. This plays a similar role for our LLLL algorithm as the MIS does for the parallel MT algorithm. By itself, the LFMIS has little connection to the LLLL, and may be of independent combinatorial and algorithmic interest.
For an undirected graph , an independent set of is a vertex set where no two vertices in are adjacent in . A maximal independent set (MIS) has the additional property that no is an independent set of . There is a trivial sequential algorithm to find an MIS of by adding vertices one-by-one to . The MIS produced by this sequential algorithm is referred to as the lexicographically first MIS (LFMIS).
With a slight abuse of terminology, we can extend the definition of LFMIS to a directed graph . Formally, we define the LFMIS of with respect to a permutation to be the vertex set produced by the following sequential process:
An undirected graph can be viewed as a directed graph , where every edge corresponds to two directed edges . The LFMIS (in the usual sense) of is then identical to the directed LFMIS of .
The LFMIS problem for undirected graphs is P-complete in general [9]. However, Blelloch, Fineman, Shun [6] described a simple parallel greedy algorithm to find the LFMIS of an undirected graph, when is chosen uniformly at random. The algorithm can also be used for directed graphs. We summarize it as follows, where we define for a vertex to be the set of vertices with .
This can be viewed as a parallel algorithm, where each iteration of identifying the residual source nodes and adding them to , can be implemented in time and processors. Alternatively, it can be viewed as a distributed algorithm, where each iteration requires distributed communication rounds on . We get the following main result to analyze Algorithm 6.
Theorem 3.1**.**
Algorithm 6 produces the LFMIS of with respect to . When is chosen uniformly at random, then Algorithm 6 terminates in rounds whp. In particular, Algorithm 6 runs in time on an EREW PRAM whp.
The analysis is very similar to the proof given in [6], which showed that the (undirected) degrees are rapidly reduced when is an undirected graph. We defer the full proof of Theorem 3.1 to Appendix D, which shows a slightly stronger result. Note that Fischer & Noever [12] later showed that Algorithm 6 terminates in rounds whp for undirected graphs. We conjecture that it should be possible to improve our analysis and show that Algorithm 6 runs in rounds whp on directed graphs as well.
4 A generic parallel resampling algorithm
We are now ready to describe our parallel algorithm for an amenable resampling-space. We recall that throughout, the parameter represents the description size of a configuration, such that a state is encoded in bits. Correspondingly, our goal for an RNC algorithm is to achieve runtime, processors, and success probability .
Clearly, if Algorithm 7 terminates, then all the bad-events in are false on . For maximum generality, we analyze Algorithm 7 in terms of two parameters from the Shearer LLLL criterion; see Appendix A for a precise definition. Theorem A.2 gives a few simpler LLL criteria, including the symmetric, asymmetric, and cluster-expansion criteria. For most applications, and . Our main result will be the following:
Theorem 4.1**.**
Let be an amenable resampling-space. If the Shearer criterion is satisfied with parameters , then Algorithm 7 runs in time and processors whp.
For most applications, we can use a simplified corollary:
Corollary 4.2**.**
Let be an amenable resampling-space which satisfies the symmetric LLL criterion . Then Algorithm 7 runs in time and processors whp.
Some probability spaces have convergence and distributional properties which go beyond the generic bounds such as Shearer’s criterion [18, 19, 27]. Since Algorithm 7 can be viewed as a simulation of the sequential algorithm, all such bounds apply equally to it. We will see some examples in the next section with analysis of the variable-assignment LLLL.
We now turn to proving Theorem 4.1. We assume throughout that is amenable. We refer to each iteration of the main loop of Algorithm 7 (lines (3) – (8)) as a round. We use , etc to denote the quantities corresponding to round , and also define . We first observe that a single round can be implemented efficiently.
Proposition 4.3**.**
Each round of Algorithm 7 can be implemented using processors and time whp.
Proof.
Since is amenable, we can determine the set using our BEC in time.
By (D1), we can draw the random variables in time . In light of (C4), we can efficiently check if , by computing and testing if .
By Theorem 3.1, we can find in time and processors whp.
To implement step (7), we use use the associativity of monoid multiplication to compute the product in rounds of pairwise multiplications. By (D3), each round takes time. Noting that , this gives a total of time and processors. Once this product is computed, we can use (D2) to compute . ∎
Thus, our main task is to show that Algorithm 7 terminates after a small number of rounds. We do so by coupling it to a sequential resampling algorithm, Algorithm 8.
By the principle of deferred decisions, there is no difference in selecting the random variable in a “preprocessed” way (as in line (4) of Algorithm 8), as opposed to in “online” way as in Algorithm 2. Thus, line (8) of Algorithm 8 is equivalent to executing the resampling oracle and so Algorithm 8 can be viewed as a version of Kolmogorov’s algorithm (Algorithm 4).
For Algorithm 8, define to be the chosen ordering of , i.e. the map sending to in . Also define to be the set of events resampled in round , i.e. the events such that at iteration of line (7). The following result shows the equivalence between Algorithm 8 and Algorithm 7:
Proposition 4.4**.**
If the random variables are all fixed at the beginning of round and are the LFMIS produced for Algorithms 7 and 8 respectively for round , then .
Proof.
Let denote the state after iteration of round (and is the state at the beginning of round ). We have enumerated as where , and we write as shorthand for .
With this notation, observe that iff there is no with and either (a) or (b) . Similarly, iff there is no with and either (a) or (b) is false on . For contradiction, say that is minimal such that the membership of differs in and .
Suppose that . Since , there must be some with such that or . In the former case, by our induction hypothesis and this would contradict that . In the latter case, note that since , it must be that is true on and and is true on . Thus, and . So , a contradiction.
Next, suppose that . Since , there must be some with such that or is false on . Let be minimal subject to these conditions. In the former case, by induction hypothesis ; in the latter case, by minimality of , it must be that becomes false after resampling , and so . In either case, we have . So and , implying that . Thus has an edge , contradicting that . ∎
They key property we need to analyze Algorithm 8 is the following:
Lemma 4.5**.**
If for , then .
Proof.
In the execution of Algorithm 8, let denote the total number of resamplings before round (so ), and note that is the state immediately at the beginning of round .
By definition, must be true on . Either is true at time or ; otherwise, by property (C4), would remain false after all the resamplings in round .
If or we are done. Otherwise, suppose . This can only be the case if was marked as dead in round . Suppose this occurs at time , during the resampling of some . If , we are done.
Otherwise, suppose that is false on . Since is true at the beginning of round , by (C4) there must be some resampled between times and with , i.e. . ∎
Lemma 4.5 in combination with analysis of Kolmogorov [29] shows that Algorithm 7 terminates in a small (polylogarithmic) number of rounds. There is also a “random-like” distribution of the states during intermediate stages of the parallel LLLL algorithm. In all, we get the following bound:
Lemma 4.6**.**
Whp, Algorithm 7 terminates after rounds and .
The proof of Lemma 4.6 requires significant background and a number of preliminary definitions, so we defer it to Appendix A.
Now let denote the total number of rounds in Algorithm 7. Proposition 4.3 shows that each round uses time and processors whp. Property (D0) allows us to implement step (1) in time. Thus the overall runtime of Algorithm 7 is at most . By concavity, we have . By Lemma 4.6, we have whp. Thus, the time complexity here is as most and the processor count is at most .
This shows Theorem 4.1. Corollary 4.2 follows directly, noting that .
5 The variable-assignment LLLL
The variable-assignment LLLL is one of the most important LLLL probability spaces. Let us set notation and discuss how this fits into our resampling framework. We also discuss a few unique properties of the variable-assignment LLLL as well as some applications.
To begin the construction, we first consider the simplest setting, where the probability space is defined by single variable over a universe . The generic bad-event set has the tautological event , as well as an event for each . We define by setting for . (The event is not dependent with any others.)
We form using a construction called the find-last monoid. Formally, we define , where is an identity element. The binary operation on is defined as
[TABLE]
Note that , with for , and so naturally gives a left -act on .
For event , we define to be the value with probability one. For an event , we define to be the distribution . One can easily verify that the resulting resampling oracle is defined by , where is drawn from the distribution , i.e. we resample the variable. It is trivial to verify that this resampling-space satisfies (C3’), (C4), (C5), and (D0)–(D3).
We can get the full variable-assignment LLLL via the cartesian product construction. Namely, the probability space is over for some discrete set , and each bad-event has the form , wherein is either or an event . Equivalently, can be written as . For such an event, we define as follows: For , the entries are all independent, wherein for and otherwise. The resulting oracle is to simply resample the variables . By Observation 2.6, this resampling-space is again amenable.
This is a very notationally heavy way of describing a very simple probability space and a very simple resampling action. However, it illustrates how our resampling framework gives a non-trivial resampling-space (the full variable-assignment LLLL) by composing a few trivial building-blocks.
5.1 Alternate LLLL criterion
In [18], Harris described an alternative convergence criterion for the MT algorithm called orderability. This is defined in terms of a function ; the full formal definitions are technical and are deferred to Appendix B. As our parallel algorithm for the variable-assignment LLLL can be viewed as an implementation of the MT algorithm, the orderability criterion can also be used to analyze Algorithm 7. This gives the following result:
Theorem 5.1**.**
Let satisfy the orderability variable-assignment criterion with -slack, and let . If has a BEC using time and processors, then Algorithm 7 runs in time and processors whp.
As a example application, we get the following result:
Proposition 5.2**.**
Suppose we have a -SAT instance in variables, where each variable appears in at most clauses. Then there is a parallel algorithm to find a satisfying assignment in time using processors whp.
Proof.
As shown in [18, Theorem 4.1], the orderability criterion can be satisfied with slack satisfied under these conditions using the weighting function for all . Furthermore, and we can implement a BEC by checking every clause. ∎
5.2 Distributed algorithms
The LOCAL model is a popular model for distributed graph algorithms. Here, in each round, a node in a graph can perform arbitrary computations and has unlimited communication with its neighbors. Distributed LLL algorithms can solve a number of graph problems in this setting, where each vertex has a set of associated bad-events local to , and bad-events in and are dependent iff the distance from to is bounded by some (problem-specific) constant.
As a simple example, consider finding a proper vertex-coloring. For each vertex , we have some bad-events that chooses the same color as a neighbor . Observe now that and are dependent iff there is some common vertex , i.e. . See [8] for a thorough discussion of this model of computation and applications to a number of graph-coloring problems.
Our parallel algorithm can be easily transformed into a distributed LLLL algorithm:
Proposition 5.3**.**
Suppose that the orderability variable-assignment criterion is satisfied with parameters . Then there is a distributed LOCAL algorithm to find a variable assignment avoiding in rounds whp. In particular, if , then this runs in rounds.
Proof.
All of the steps in a round of Algorithm 7, except the computation of the LFMIS at line (6) and the state update at line (8) can be implemented in communication rounds. The state update can be done in rounds and the greedy LFMIS can be implemented in rounds whp; note that Algorithm 7 only creates an edge between if overlap on a variable and so we can simulate the directed graph created in line (5). As shown in Appendix B we have whp. ∎
One application, which is an immediate consequence of LLLL analysis of [18], is to proper vertex coloring of a hypergraph:
Proposition 5.4**.**
Let be a -uniform hypergraph in which each vertex appears in at most edges. Then there is a randomized LOCAL algorithm in rounds to construct a non-monochromatic -coloring of for .
6 Other resampling-spaces
We now discuss how our resampling framework applies to a few other resampling-spaces, with some applications. The main space discussed here is the uniform distribution on . Two others are the uniform distribution on hamiltonian cycles of , and the uniform distribution on perfect matchings of the complete hypergraph for . The latter two involve very technical algebraic arguments, so we defer the full proofs to Appendices E and F.
6.1 Uniform distribution on
In this setting, we have , and we use instead of to represent the system state. The atomic sets have the form
[TABLE]
for some ; we write this as . We define on by setting if one of the following two conditions holds: (i) and or (ii) and . Equivalently, we have iff .
We define to be the symmetric group . For any , we define to be the set of single-swap permutations of the form for , and is the uniform distribution on . We define the resampling action as left-multiplication in the obvious way.
Proposition 6.1**.**
Properties (D0), (D2) and (D3) hold.
Proof.
The monoid operation and monoid act are both composition of permutations, which can easily be done in time. Property (D0) holds using any of the standard ways to generate uniform random permutations. ∎
Proposition 6.2**.**
Properties (C5) and (C1) hold.
Proof.
Consider and . We claim that there is precisely one pair with such that . For, we have iff iff . Furthermore, once is determined, is also uniquely determined.
This shows (C5). Also, when and , it implies that with probability precisely . Thus is uniformly distributed on , showing (C1). ∎
Proposition 6.3**.**
Property (C2) holds.
Proof.
Consider and and . Clearly so . Suppose for contradiction that . So . If , then , which contradicts . If , then , which is impossible as . ∎
Proposition 6.4**.**
Let and with . Let and . Then:
If , then ; 2. 2.
If , then
Proof.
In case (1), if , then , which is in by hypothesis. If , then , and so .
In case (2), since we have . If , then and so . If , then , and so . ∎
Proposition 6.5**.**
Property (C4) holds.
Proof.
Proposition 6.4 gives an explicit condition for when holds for . This condition depends solely on and not on itself; thus, for any fixed , it holds for all such or none of them. ∎
Proposition 6.6**.**
Property (C3’) holds.
Proof.
Let and where . We need to show for any fixed and indices , there exist such that
[TABLE]
If this is trivial. Also, if are distinct from each other and , then we can simply take . Otherwise, there are a number of cases depending on which of the terms are equal to each other.
Case I: . Let . If , then , and so setting works. Otherwise, if , then . So setting works. Our hypothesis ensures that .
Case II: . Then , so take .
Case III: . We may assume that , as we have already covered these cases. Then , so taking works. Note that , as otherwise we would have .
Case IV: . Then , so take . Note that we cannot have as this would imply . ∎
6.2 Applications
We illustrate with the classic applications of the permutation LLL to Latin transversals. Suppose we have an matrix , whose entries come from some set of colors. An -bounded transversal of this matrix is a permutation , such that no color appears at least times among the entries . The case is known as a Latin transversal, and in this case the permutation is said to be rainbow in that no color is repeated among the entries of .
Proposition 6.7**.**
Suppose that each color appears at most times in . Then, we can find a Latin transversal in time and processors for . We can find an -bounded transversal in time and processors for \Delta\leq n\Bigl{(}\frac{(s-1)!}{2e(1+\epsilon)s}\Bigr{)}^{1/(s-1)}.
Proof.
We use the probability space of the uniform distribution over . For the first result, observe that the cluster-expansion LLL criterion is satisfied with slack of and .
For the second result, for each tuple with , we have a separate bad-event , that . Each has probability , and has at most neighboring bad-events . Thus, in order to satisfy the symmetric LLL criterion with -slack, we need
[TABLE]
To show this, we calculate:
[TABLE]
So holds under the stated hypothesis. One can easily construct a BEC in time: for each color class, simply enumerate all of the current entries of with that color. ∎
We note that the runtime in Proposition 6.7 does not depend on . By contrast, the permutation LLL algorithm of [22] would only give a parallel algorithm for constant . There are two main reasons it has poor scaling as a function of : first, the number of bad-events could be , which is super-polynomial for unbounded ; second, each bad-event spans entries, whereas [22] only allows bad-events to use polylogarithmic entries. We also note that a sequential algorithm of [22] based on partial resampling can achieve better bounds for large , but our parallelization strategy does not extend to that case.
We next illustrate with some applications to finding rainbow subgraphs of and :
Proposition 6.8**.**
Consider an edge-coloring of where every color appears on at most edges. If and is even, then we can find a rainbow perfect matching in time and processors whp. If , then we can find a rainbow hamiltonian cycle in time and processors whp.
Proof.
We encode these problems via the probability spaces of the uniform distribution of perfect matchings of and hamiltonian cycles of , respectively. In Apppendices E and F we show that the spaces both have amenable resampling oracles. It is shown in [29] and [24], respectively, that that cluster-expansion LLL criterion is satisfied with slack and . ∎
Proposition 6.9**.**
Consider an edge-coloring of where every color appears on at most edges. If , then there is a poly-time algorithm to find a rainbow perfect matching.
Proof.
The probability space is defined by selecting matching uniformly at random. For each pair of edges of the same color, we have a bad-event that are both in . This event has probability
[TABLE]
In Appendix F, we show that has a resampling-space, albeit not a commutative one. To apply the cluster-expansion criterion, we use a slightly denser dependency graph: two events are dependent if the corresponding edges overlap. To enumerate the stable sets of neighbors of with respect to this dependency graph, for each of the vertices involved in we may select another edge and another edge of the same color as (a total of choices).
We set for every bad-event for some parameter . In order to satisfy the cluster-expansion criterion, we will then need
[TABLE]
Simple calculus shows that when the hypotheses are satisfied, then Eq. (2) can be satisfied for some . Using the resampling oracle in Appendix F, we can implement Algorithm 2 in polynomial time to produce a configuration avoiding . ∎
As another application, consider strong coloring: given a graph with a partition of the vertices into blocks each of size (i.e., ), we would like to find a proper -coloring such that every block has exactly colors. In [26], Haxell showed that such a coloring exists when and is sufficiently large, for some constant ; this is the best bound currently known. Furthermore, the constant cannot be improved to any number strictly less than . In [22], a variety of LLL-based algorithms are given for constructing the colorings, with worse bounds on and with large (unspecified) runtimes. Our LLLL algorithms gives a crisp result, which is perhaps the first parallel algorithm with reasonable bounds on both and the run-time:
Proposition 6.10**.**
Given a partition of into blocks of size , a coloring of can be found in time whp.
Proof.
Consider the probability space of uniform distribution over permutations , wherein each is a permutation of the vertices in block . For each edge with , and each value , we have a bad-event . Harris & Srinivasan [22] show that this satisfies the LLLL cluster-expansion criterion with -slack when . Furthermore, the probability space is the cartesian product of copies of the uniform distribution on . By Observation 2.6, this has an amenable resampling-space. ∎
We note that, subsequent to the original version of this paper, a variety of works have appeared with better bounds on the colors and the runtime for strong coloring [14, 15, 20]. Most recently, [20] provides a deterministic sequential poly-time algorithm for and a deterministic parallel algorithm with runtime for , for any constant .
Finally, we consider a hypergraph packing problem of Lu & Székély [31].
Proposition 6.11**.**
Let be two -uniform hypergraphs on vertices, where each has edges such that .
There is an algorithm in processors and time to find an injective map such that is edge-disjoint to . (That is, there are not edges with .)
Proof.
Let us briefly review a construction of [31]. We use the LLL to construct the permutation . For each pair of edges , and each permutation , we form a bad-event that . The stated hypothesis ensures that these events satisfy the symmetric LLL criterion. Furthermore, there is a simple BEC here which can be implemented in time: for each , we sort and check if it in . ∎
Note that Harris & Srinivasan [22] only gives an RNC algorithm if the hypergraphs have rank ; this condition is not required for Proposition 6.11.
7 Acknowledgments
Thanks to Chen Meiri for explanations about group actions. Thanks to anonymous conference and journal reviewers for helpful suggestions and corrections.
Appendix A Background on the LLLL
Consider some resampling-space with a lopsidependency relation . The simplest criterion for the LLL or the LLLL on is the symmetric criterion , where is the maximum probability of any bad-event and is the maximum dependency of any bad-event. A number of other criteria such as the asymmetric criterion can also be stated in terms of the probabilities and dependency-structure of the bad-events; the most general of these is Shearer’s criterion [37]. Parallel algorithms usually need a slightly stronger criterion which we refer to as -slack: the vector of probabilities must satisfy Shearer’s criterion for .
We will describe the Shearer criterion in terms of stable-set sequences, which is a more useful tool for analyzing the MT algorithms. The connection between stable-set sequences and the original form of Shearer’s criterion was developed by Kolipaka & Szegedy [28].
We say that a set is stable if there are not distinct elements with . For a stable set , we define .
We define a stable-set sequence to be a sequence , where each is a non-empty stable set of and for . We say that is singleton and rooted at if . We define the depth of to be , the size of to be and the weight of to be . We define to be the set of all singleton stable-set sequences.
Theorem A.1** ([28]).**
If Shearer’s criterion is satisfied with -slack, then .
In light of Theorem A.1, we define the key parameter . This allow us to state the most general bounds. However, Shearer’s criterion is difficult to work with in practice, so a number of simpler LLL criteria are often used instead.
Theorem A.2**.**
1. (Asymmetric LLL criterion) Suppose that some function satisfies
[TABLE]
Then Shearer’s criterion is satisfied with -slack and .
2. (Cluster-expansion criterion [5]) Suppose that some function satisfies
[TABLE]
Then Shearer’s criterion is satisfied with -slack and .
3. (Symmetric LLL criterion) Suppose that and for every , and . Then Shearer’s criterion is satisfied with -slack and .
For each bad-event during Algorithm 8, we define a corresponding sequence by setting and then, for , setting .
Proposition A.3**.**
For , the sequence is a stable-set sequence of depth rooted at .
Proof.
Clearly has depth and , and also clearly . Since is stable, so is each . Finally, to show that is non-empty, consider some ; note that Lemma 4.5 ensures that there is some ; this will appear in . ∎
We say that a given depth- stable-set sequence rooted at appears if . Iliopoulos [27] showed a connection between appearing stable-sequences and probabilities of bad-events in Algorithm 2 for a commutative resampling oracle. These bounds also apply to Algorithm 8 since it is a version of Algorithm 2. We summarize the key result as follows:
Proposition A.4** ([27]).**
For a commutative resampling oracle, any stable-set sequence appears with probability at most .
Using our bounds on stable-set sequences and arguments from [16], we now prove Lemma 4.6:
Proof of Lemma 4.6.
Each corresponds to an appearing depth- stable-set sequence . All such stable-set sequences are distinct: if , then the depths of and are distinct, while if then the roots of and are distinct.
Thus, is at most the number of appearing stable-set sequences. Proposition A.4 shows that . So by Markov’s inequality, whp.
If Algorithm 8 runs for rounds, then for each , there is at least one appearing depth- stable set sequence (namely for an arbitrary ). Thus, a necessary condition for Algorithm 8 to run for rounds is that at least distinct singleton stable-set sequences of size at least appear. By Proposition A.4, the expected number of such sequences is given by
[TABLE]
By Markov’s inequality, the probability that the actual number exceeds is at most . This is below for some . ∎
Appendix B Alternative variable-assignment LLLL criterion
We summarize here an alternate criterion of Harris for the variable-assignment LLLL [18].
Given a bad-event of the variable-assignment LLLL and a set , we say that is orderable to if either , or there is an ordering and an ordering such that, for each , the bad-event demands and none of the events for do so. We also say that a map satisfies the orderability criterion with -slack for if it satisfies
[TABLE]
The main result of [18] is the following:
Theorem B.1**.**
Suppose that the map satisfies the orderability criterion with -slack for . Then the expected number of resampling executed by the MT algorithm is at most .
To show this, [18] defined a type of witness tree, which differs slightly from the witness trees in the original analysis of Moser & Tardos and from the stable-set sequences discussed in Appendix A. Let us summarize very briefly. Suppose we have run the sequential MT algorithm up to some time , resampling bad-events , and that some event is currently true. To generate the witness tree , we start with a root node labeled . For each , we try to add a node to the tree with label , placing it as a child of some node labeled with . If there are multiple eligible positions we place the node at greatest depth (breaking ties arbitrarily).
However, one additional condition is enforced: for any node with label , the children of must have distinct labels such that is orderable to . A node is not eligible to have a child node labeled , if adding such node would violate this condition.
We say that a labeled tree appears if for any event and time . We define to be the set of all possible labeled trees that could appear. The key lemma of [18] is the following:
Lemma B.2**.**
Any labeled tree appears with probability at most . Furthermore, we have where we define .
Algorithm 7 can be viewed as a simulation of the sequential MT algorithm, so this same lemma applies to it. By using arguments of [18] for a similar parallel resampling algorithm, we can see that if a bad-event is true after total resamplings in the middle of round of of Algorithm 8, then witness tree has depth and is rooted at . This allows us to show a result analogous to Lemma 4.6 in terms of the orderability criterion, and thereby to show Theorem 5.1. Since the proof is nearly identical to Lemma 4.6 and Theorem 4.1, we omit it here.
Appendix C Proof of Theorem 2.4
We suppose here we have a resampling-space satisfying conditions (C1), (C2), (C4). At later stages in the proof we may also assume it satisfies conditions (C3’) and (C5).
It will be convenient to work with ordered sequences from . We say that is a stable list if for . For a permutation , we define . Likewise, we define to be the set of products wherein . Whenever we discuss resampling an event and we write , then we tacitly assume that we have chosen to order the elements of as , so that for the stable list .
Proposition C.1**.**
* satisfies (C1).*
Proof.
Consider . Let be independent variables, wherein is drawn from . We need to show that when , then .
For each let us define and . Since each is chosen from , we see that with probability one for all . We will show that that by induction on . The base case is given to us by hypothesis (since ), and the case is what we are trying to prove.
Consider a state and . For any , property (C1) gives . If , then we claim that ; for, if for some , then by property (C2) as well. Similarly, if , then ; for if , then by property (C4) we would have . Thus, for , we have
[TABLE]
By induction hypothesis, and both have the distribution . Likewise, and both have the distribution . Furthermore, the variables are independent and the variables are independent. This implies that
[TABLE]
So . This shows that is proportional to for any . Since with probability one, this implies that . ∎
Proposition C.2**.**
* satisfies (C2).*
Proof.
Consider and with , and let . Consider . There must exist some such that . We can show that that for all , by an induction on : the base case holds since , and the induction step follows from property (C2) applied to event and .
At , this shows that . ∎
Proposition C.3**.**
Let and be events in where . For any state and , the following are equivalent:
** 2. 2.
There exist such that and for all
Proof.
For (2) (1), a simple induction on shows that for .
For (1) (2), the definition of shows where each is in . If for we are done; otherwise, let be minimal such that for some . So . Since , by repeated applications of (C2), we see also that is also not in and hence not in . ∎
Corollary C.4**.**
* satisfies (C4).*
Proof.
For events with , Proposition C.3 gives an explicit condition on to ensure that for . This condition depends solely on , and not itself. ∎
Proposition C.5**.**
If satisfies (C5), then satisfies (C5).
Proof.
Consider for . For each we define . For , we claim that there exists exactly one state such that . The base case holds vacuously with , and the case is what we are trying to show.
For the induction step, we first show existence. By (C5), there exists such that . So for some . By induction hypothesis, we have for . Since , it must be the case that and for each such . Thus, and .
Next, we show uniqueness. Suppose that for some . Since and , by (C5) this implies that . ∎
Proposition C.6**.**
Suppose that satisfies (C3’). Then for a stable list , any , and any , we have .
Proof.
Since we can generate any permutation by swapping adjacent elements, it suffices to show this holds when for some .
Let wherein each . Define . Note that . By (C3’) applied to events , there exist with . Since , it must be the case that and .
Now set . We thus have shown that . Furthermore, we have . ∎
Proposition C.7**.**
If satisfies (C3’), then satisfies (C3’)
Proof.
Consider events and and any . By symmetry, it suffices to show that for any there are with .
Define . By definition of , we have where for . By Proposition C.3, we have where for . Thus, we see that .
Now define and note that is a rearrangement of the list . By Proposition C.6, this implies that there exists such that . We can write , wherein for , and for . If we set and , then and by Proposition C.3 we have . We then have as desired. ∎
Appendix D Proof of Theorem 3.1
Consider a directed graph , with a permutation chosen uniformly at random. Let denote the directed acyclic graph on vertex set and edge-set . Let denote the LFMIS of with respect to . For any integer , define the partial LFMIS . For integers , define the residual vertex set and define to be the induced subgraph .
For the purpose of analysis, it will be useful to consider a slowed-down variant of Algorithm 6 called SLOW-GREEDY, as discussed in [6]. Given integers , it is defined as follows:
We refer to the iteration of the loop in line (2) as epoch . We make the following observations for Algorithm 9; since the proofs are completely analogous to the undirected case, we refer to the reader to [6] for full proof details.
Proposition D.1** ([6]).**
For any integers with , we have the following:
SLOW-GREEDY computes the LFMIS of with respect to . 2. 2.
The number of rounds in Algorithm 6 on and is at most the total number of rounds in SLOW-GREEDY. 3. 3.
If all directed paths in have length at most , then epoch of SLOW-GREEDY terminates in at most rounds.
Algorithm 6 can be viewed as a special case of SLOW-GREEDY with ; in particular, this shows that Algorithm 6 correctly computes the LFMIS of with respect to .
We now analyze the path lengths in the subgraphs . For , let us define
[TABLE]
Proposition D.2**.**
With probability at least , we have for any .
Proof.
Let us fix some vertex , and we want to show that either or for . For each define to be the event that is alive and has at least alive in-neighbors after step of Algorithm 5.
We compute the probability of conditional on . As are determined by , it suffices to compute the probability of conditional on . This allows us to determine the set of alive in-neighbors of after step . If , then is false. Otherwise, we have with probability at least , in which case is removed from after iteration and is false. Thus, . This implies that
[TABLE]
By definition contains only vertices which are alive after iteration . Thus, if is false, the desired property holds for and . To finish, taking a union bound over all values of . ∎
Proposition D.3**.**
Suppose that we condition on , and let , and let denote the length of the longest path in . Then, with probability at least , it holds that
[TABLE]
Proof.
Consider the induced graph , which depends only on the values . Let be the maximum in-degree of . We can enumerate the length paths of by choosing the final vertex in the path ( choices), and each of the previous vertices in the path ( choices each), so the number of length -paths in is at most .
A necessary condition for a path to survive to is that . Having conditioned on , this event has probability
[TABLE]
Taking a union-bound over all such paths, we have
[TABLE]
If , then note that for this is at most . If , then set and ; we then have (es/k)^{k}=\exp\Bigl{(}\frac{-10\log n}{\log x}\times\log\bigl{(}\frac{10\log n}{es\log x}\bigr{)}\Bigr{)}=\exp\Bigl{(}\frac{-10\log n}{\log x}\times\log\bigl{(}\frac{5x}{e\log x}\bigr{)}\Bigr{)}. As , standard analysis shows that \log\bigl{(}\frac{5x}{e\log x}\bigr{)}\geq 0.5\log x for . Thus, this is at most \exp\bigl{(}\frac{-10\log n}{\log x}\times 0.5\log x\bigr{)}=e^{-5\log n}=n^{-5}. ∎
We are now ready to bound the runtime. We show a slightly tighter bound in terms of the maximum in-degree of graph .
Theorem D.4**.**
Let . When is chosen uniformly at random, then:
For , Algorithm 6 takes O\Bigl{(}\frac{\log n}{\log\tfrac{2\log n}{d}}\Bigr{)} rounds whp. 2. 2.
For , Algorithm 6 takes rounds whp.
In particular, Algorithm 6 takes rounds whp.
Proof.
1. By Proposition D.3 applied at , whp the graph has maximum path length where . By Proposition D.1, this implies that Algorithm 6 terminates in rounds whp.
2. We will use Proposition D.1 with parameters and for and . Note that as required, since .
Define for . For , we have . For , Proposition D.2 shows that with probability at least , in which case . When these events occur, then by Proposition D.3, each graph for has maximum path length with probability at least .
By Proposition D.1, these facts imply that, whp, each epoch of SLOW-GREEDY takes rounds. Overall, the total number of rounds over all epochs is . ∎
Appendix E Hamiltonian cycles of
In order to use algebraic tools, we encode a hamiltonian cycle of as the permutation . In this way, the ground set can be viewed as the set of permutations consisting of precisely one cycle of length . We define to be the group with the natural group action of left-multiplication on ; thus properties (D0), (D2), (D3) are trivial.
For any sequence of distinct values , let us define the set of permutations
[TABLE]
Note that each choice for the values for give rise to a distinct permutation. Thus, .
We are now ready to define the resampling-space itself. Let be the set of paths where are distinct elements of . We define the support of the path by . For such path , define an atomic event
[TABLE]
We define the dependency relation by setting if .
For a given set , let us define to be the set of permutations in whose cycle structure consists of fixed points at each , along with a single cycle on . Note that . There is an important permutation which “normalizes” the path , namely
[TABLE]
For , we define to be to the uniform distribution on . The following observations explain the role of :
Observation E.1**.**
For and path , we have iff .
Proposition E.2**.**
Let . For and , we have .
Proof.
Let where , and for . We show by induction on that . The base case at is precisely Observation E.1 since , and the case at is what we are trying to show since and .
For the induction step, we have . The point does not appear in the cycle of by induction hypothesis. However, since , the point does so. Thus has inserted just before in its cycle, moving from a fixed point to part of its cycle. ∎
We now show that the necessary properties are satisfied.
Proposition E.3**.**
Properties (C5) and (C1) hold.
Proof.
Consider for a path and let . We claim that there is precisely one choice for the ordered pair with and such that .
Since is uniquely determined from , we will show that there is precisely one choice for such that . By Observation E.1, this is equivalent to showing .
Consider where . We want to show that there is a unique choice for indices such that is in .
It suffices to show that for any index and , there is a unique choice for such that . Since , the element appears in the full cycle, followed by some . Now note that has an additional fixed point at precisely if . Thus there is precisely one choice of with .
This shows the claim and immediately gives (C5). For (C1), note that for any , the probability of , where is drawn uniformly from and is drawn uniformly from , is precisely . ∎
Proposition E.4**.**
Property (C2) holds.
Proof.
Consider for and for with and . There must exist some index with .
Let . We claim that so that .
To show this, define for , wherein . Suppose that is minimal such that . It cannot be , as (since ).
For this value , it must be either that (a) or (b) . The former cannot occur as and the latter cannot occur as . ∎
Proposition E.5**.**
Let and . Let where . Then iff are all distinct from .
Proof.
The reverse direction is immediate. For the forward direction, define for and let be minimal such that . We show by induction that for we have . For the base case, we have . For the induction step, suppose that . If we have as desired. If , then , again as desired.
Thus, if some of the are equal to then , and in particular . ∎
Proposition E.6**.**
Property (C4) holds. Furthermore, for with , we have
[TABLE]
Proof.
Let . Consider . For , we have ; by Proposition E.5 this is equal to iff are distinct from . Thus, iff are distinct from . To show (C4), note that this criterion does not depend on , so it either holds for all or none of them. ∎
Given any event and stable set , this result allows us to efficiently draw from , by selecting indices wherein each is distinct from the tail for each in . In particular, this shows (D1’).
We will now show commutativity. This follows from the observation that depends only on the unordered set :
Proposition E.7**.**
For any distinct values and any permutation , we have
[TABLE]
Proof.
It suffices to consider for . Consider where . We will show that there exist such that with . In this case, replacing the terms with allows us to swap , showing that . There are a few cases.
If all four values are distinct, then and so works. 2. 2.
If , then . Thus taking and works. 3. 3.
If , then . Thus taking works. ∎
Proposition E.8**.**
Property (C3’) holds.
Proof.
Let where with . We will show that
[TABLE]
where we define . Note that and commute since , and by Proposition E.7 the set does not depend upon the ordering of the list , and so by symmetry this will then show that as desired.
Since , the values are distinct from . We have and . Using the explicit description of from Proposition E.6, we calculate . Thus . We will show that ; a counting argument then shows Eq. (3).
Consider of the form
[TABLE]
where and .
If , then . Otherwise, for , we have . This shows that , where is defined as
[TABLE]
So we have shown that . Since , likewise . So, by Proposition E.6 we have . Clearly, . So we have shown that can indeed be written as an element of . ∎
Appendix F Perfect matchings of
Let us fix throughout this section and a multiple of and we define to be the set of perfect matchings of . Note that the case is the space of perfect matchings of , which has been studied more extensively, with a commutative resampling oracle given by Kolmogorov [29]. In [30], Lu, Székély & Mohr showed (non-algorithmically) that the LLLL held for all .
We will construct an oblivious resampling-space for the uniform distribution on . This gives efficient sequential algorithms. We also show that when , the space is commutative and is compatible with our parallel algorithm.
The probability space is the uniform distribution on . For every size- subset of , we define the atomic event
[TABLE]
The dependency relation is defined by setting iff and .
The monoid is the symmetric group , with the natural group action on defined by
[TABLE]
It is clear that properties (D0), (D2), (D3) hold.
Whenever we enumerate an edge , we always assume implicitly it is sorted so that . With this notation in mind, for an event we define the set of permutations
[TABLE]
and we define to be the uniform distribution on . Note that each choice of gives rise to a distinct permutation, so that also corresponds to the distribution obtained by choosing each index independently and uniformly from the range the .
Proposition F.1**.**
For any event and any , there are precisely ordered pairs such that . In particular, for , property (C5) holds.
Proof.
Let . Since is uniquely determined from it suffices to show there are precisely choices for such that .
Consider where . For each let us define to be the set of matchings such that for some . We claim that, given any matching , there are precisely choices for such that . As and , this will establish that there are precisely choices for such that is in .
Now suppose we have chosen values , and so has been determined. By hypothesis, and so contains an edge . We have iff is swapped into edge , which occurs precisely when . Thus, there are choices for as we have claimed. ∎
Proposition F.2**.**
Property (C1) holds.
Proof.
Consider event . By Proposition F.1, there are precisely pairs which lead to a given matching . Thus, when and , we have . This does not depend upon , and so is uniformly distributed. ∎
Proposition F.3**.**
Property (C2) holds.
Proof.
Consider where and and . We cannot have since is non-empty, and so are disjoint.
Suppose for contradiction that for . Let be maximal such that . We must have , since . It must be the case that . Then matching must contain an edge . Thus, is matched to the vertices in . On the other hand, the entries are all distinct from ; therefore, in the matching , the entries are not affected, and so are matched to each other. Thus is matched in to vertices in as well as vertices in . Since contains only -edges, this is impossible. ∎
Proposition F.4**.**
Let where and and for . Consider of the form where . Let .
If , then 2. 2.
If , then .
Proof.
For case (1), suppose . So each permutes two elements within , and thus a simple induction on shows that for all . In particular . On the other hand, let be maximal such that . Then . This will remain matched to in , and in particular .
For case (2), we have since . If , then edge is unaffected in , and so . On the other hand, let be maximal such that . This remains matched to in , and in particular the edge cannot remain in . ∎
Proposition F.5**.**
Property (C4) holds.
Proof.
Proposition F.4 gives an explicit condition on when for . This condition depends solely on and not on . ∎
Proposition F.6**.**
Property (D1’) holds.
Proof.
Consider and where . If are distinct from , then we can sample by selecting each independently from the set . Similarly, if one of the sets is equal to , then we select independently from . ∎
Proposition F.7**.**
For , property (C3’) holds.
Proof.
Consider and a matching . We need to show that for any there are and such that
[TABLE]
By relabeling, we assume without loss of generality that , and , and that either or . We have exhaustively tested all choices in both cases, verifying that there is always a choice of satisfying Eq. (4). ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Achlioptas, D., Iliopoulos, F.: Random walks that find perfect objects and the Lovász Local Lemma. Journal of the ACM 63(3), Article #22 (2016)
- 2[2] Achlioptas, D., Iliopoulos, F.: Focused stochastic local search and the Lovász local lemma. Proc. 27th ACM-SIAM Symposium on Discrete Algorithms (SODA), pp. 20248-2038 (2016)
- 3[3] Achlioptas, D., Iliopoulos, F., Sinclair, A.: Beyond the Lovász Local Lemma: point to set correlations and their algorithmic applications. Proc. 60th IEEE Symposium on Foundations of Computer Science (FOCS), pp. 725-744 (2019)
- 4[4] Albert, M., Frieze, A., Reed, B.: Multicoloured Hamilton Cycles. The Electronic Journal of Combinatorics 2(1), R 10 (1995)
- 5[5] Bissacot, R., Fernandez, R., Procacci, A., Scoppola, B.: An improvement of the Lovász Local Lemma via cluster expansion. Combinatorics, Probability and Computing 20(5), pp. 709-719 (2011)
- 6[6] Blelloch, G., Fineman, J., Shun, J.: Greedy sequential maximal independent set and matching are parallel on average. Proc. 24th ACM Symposium on Parallelism in Algorithms and Architectures (SPAA), pp. 308-317 (2012)
- 7[7] Brandt, S., Fischer, O., Hirvonen, J., Keller, B., Lempiäinen, T., Rybicki, J., Suomela, J., Uitto, J.: A lower bound for the distributed Lovász Local Lemma. Proc. 48th ACM Symposium on Theory of Computing (STOC), pp. 479-488 (2015)
- 8[8] Chung, K., Pettie, S., Su, H.: Distributed algorithms for the Lovász local lemma and graph coloring. Distributed Computing 30(4), pp. 261-2680 (2017)
