TL;DR
This paper introduces a generalized, automatic speedup technique for distributed problems, transforming algorithms into faster ones across different problems, and applies it to prove a long-standing lower bound for weak 2-coloring.
Contribution
It extends the speedup technique to any locally checkable problem, enabling problem transformations and proving new lower bounds in distributed computing.
Findings
Established a generalized speedup method for distributed problems.
Solved the open problem of lower bounds for weak 2-coloring in odd-degree graphs.
Provided a sequence of problems with decreasing deterministic time complexities.
Abstract
Recently, Brandt et al. [STOC'16] proved a lower bound for the distributed Lov\'asz Local Lemma, which has been conjectured to be tight for sufficiently relaxed LLL criteria by Chang and Pettie [FOCS'17]. At the heart of their result lies a speedup technique that, for graphs of girth at least , transforms any -round algorithm for one specific LLL problem into a -round algorithm for the same problem. We substantially improve on this technique by showing that such a speedup exists for any locally checkable problem , with the difference that the problem the inferred -round algorithm solves is not (necessarily) the same problem as . Our speedup is automatic in the sense that there is a fixed procedure that transforms a description for into a description for and reversible in the sense that any -round algorithm for can be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
- An Automatic Speedup Theorem for Distributed Problems
Sebastian Brandt [email protected] ETH Zurich
- Abstract. Recently, Brandt et al. [STOC’16] proved a lower bound for the distributed Lovász Local Lemma, which has been conjectured to be tight for sufficiently relaxed LLL criteria by Chang and Pettie [FOCS’17]. At the heart of their result lies a speedup technique that, for graphs of girth at least , transforms any -round algorithm for one specific LLL problem into a -round algorithm for the same problem. We substantially improve on this technique by showing that such a speedup exists for any locally checkable problem , with the difference that the problem the inferred -round algorithm solves is not (necessarily) the same problem as . Our speedup is automatic in the sense that there is a fixed procedure that transforms a description for into a description for and reversible in the sense that any -round algorithm for can be transformed into a -round algorithm for . In particular, for any locally checkable problem with exact deterministic time complexity on graphs with nodes, maximum node degree , and girth at least , there is a sequence of problems with time complexities , that can be inferred from .
As a first application of our generalized speedup, we solve a long-standing open problem of Naor and Stockmeyer [STOC’93]: we show that weak -coloring in odd-degree graphs cannot be solved in rounds, thereby providing a matching lower bound to their upper bound.
1 Introduction
In this work, we study the question of determining the time complexity of distributed graph problems from the perspective of round elimination. More concretely, we ask: Given a problem , can we find another problem which can be solved in exactly one round less? Is it perhaps even possible to infer such a problem from in an automated fashion, so that we can obtain a sequence of problems with decreasing complexities until we end up with a problem that can be solved in [math] rounds? We will show that, given certain (reasonable) conditions, the answer to both questions is yes. As a concrete evidence of the power of this automatic speedup, we resolve the complexity of odd-degree weak -coloring, an open question asked by Naor and Stockmeyer in 1993 [24].
Model
Our distributed model of computation is a variant of the well-known model [22, 25], a synchronous message passing model where the nodes of a given input graph are processors that have the task to collaboratively solve some graph problem on . The essential difference between our model and the model is that in our case nodes are not equipped with unique identifiers, but (potentially) have some other symmetry breaking information available. The class of problems we consider are locally checkable problems, i.e., problems where the global validity of a solution can be checked locally by the nodes in constant time. More precisely, for some constant , each node has a set of acceptable output configurations for its radius- neighborhood, and a global solution is considered valid if and only if the output configuration of each node’s radius- neighborhood is acceptable. Our speedup results are about the deterministic time complexity of locally checkable problems; however, there exist known techniques to lift the obtained bounds to both the randomized and the deterministic model. We will elaborate on these techniques in Section 2.2.
1.1 Our Contributions
Our main contributions are twofold:
- We present a speedup theorem that generalizes the speedup technique of Brandt et al. [9]: In that work, the authors obtain a lower bound for the distributed Lovász Local Lemma by developing a round elimination technique for a problem called sinkless orientation. We show that such a speedup exists for arbitrary locally checkable problems. In particular, we provide a method that takes the description of a problem as input and outputs the description of a problem that can be solved one round faster but not any faster than that. This facilitates exciting new approaches for obtaining time complexity lower (and upper) bounds.
- We show that our speedup technique is a powerful tool by using it to prove a tight lower bound for weak -coloring in odd-degree graphs, answering the long-standing open question by Naor and Stockmeyer.
Speedup Results
We provide an automatic procedure that transforms any given locally checkable problem into a (locally checkable) problem such that, informally speaking, the following holds.
Theorem 1** (informal).**
Let be a locally checkable problem and the problem obtained by applying our speedup transformation. On graphs of girth at least , the following two statements are equivalent:
- (1)
There is an algorithm solving in time .
- (2)
There is an algorithm solving in time .
If has an exact deterministic time complexity on graphs with nodes, maximum node degree , and girth at least , then applying this speedup iteratively yields a sequence of problems with time complexities . Now determining the time complexity of just a single in the sequence will automatically determine the time complexities of all other , and, most importantly, of our initial problem . We will give a detailed explanation how to apply this technique to infer bounds for the complexity of a problem in Section 2.1, but let us first consider a concrete application. We remark that, apart from the following application, our speedup theorem also semi-automatically reproduces previously known techniques, such as the sinkless orientation speedup [9] and color reduction on rings (leading to the upper bound for -coloring a ring [14, 18]), as we will see in Sections 4.4 and 4.5. Moreover, building on our speedup technique, Balliu et al. [2] very recently proved new lower bounds for maximal matching and maximal independent set, in both the deterministic and the randomized setting.
Odd-Degree Weak -Coloring
Weak -coloring is the problem of coloring the nodes of a given input graph with colors such that each node (of degree at least ) has at least one neighbor with a different color. In their seminal work [24], Naor and Stockmeyer proved that in graphs where each node has odd degree, a weak -coloring can be found in time . As one of their three open questions they asked whether this bound can be improved. While the question seemed simple enough, surprisingly no progress has been made over the past 25 years, indicating that the available lower bound techniques might not be sufficient for showing that the upper bound is tight. Using our speedup results, we close this gap by showing the following theorem.
Theorem 4**.**
There is no -time algorithm solving weak -coloring in odd-degree graphs.
Given the speedup framework, the main technical ingredient in our proof is a generalization of weak -coloring to a problem we call superweak -coloring that has the following two nice properties:
If we set to be superweak -coloring, then the problem obtained by applying our speedup is at least as hard as superweak -coloring, for some . 2. 2.
Relaxing to superweak -coloring (and then continuing to apply the speedup technique) is sufficiently tight, in the sense that we essentially still need speedup steps until we obtain a [math]-round solvable problem.
More concretely, a rough proof outline goes as follows. Relax weak -coloring to superweak -coloring, apply our speedup, relax the obtained problem to superweak -coloring for some , apply our speedup, relax to superweak -coloring for some , etc. Then, show that any problem obtained after steps of speedup and relaxation is still not solvable in [math] rounds. Our speedup results then immediately imply that there is no -algorithm for weak -coloring.
1.2 Related Work
Follow-up Work
In a recent breakthrough, Balliu et al. [2] used our speedup technique to show that, both for maximal matching and maximal independent set, there is no randomized algorithm with runtime and no deterministic algorithm with runtime . As documented in [2, Section 3.7], apart from the speedup provided by Theorem 1, the authors also apply both of our simplification techniques (see Section 2.1) to achieve their lower bounds.
Further Related Work
The first occurrence of the round elimination speedup technique we extend to any locally checkable problem was seen in [9], where the authors show that such a speedup works for the problem of (-regular) sinkless orientation, resulting in a randomized lower bound of that also applies to the (constructive symmetric) distributed Lovász Local Lemma (LLL) and -coloring. In [12], Chang et al. proved that this result can be extended to a deterministic lower bound of , which is tight (for sinkless orientation) due to a matching upper bound by Ghaffari and Su [17], who also gave a matching upper bound for the deterministic case. Chang and Pettie [13] conjectured that the (randomized) lower bound for the distributed LLL is tight for sufficiently relaxed LLL criteria; despite a recent improvement of the upper bound by Ghaffari et al. [16], this conjecture is still open. In [11], Chang et al. simplified the randomized speedup technique of [9] and showed that the sinkless orientation lower bounds also imply an deterministic and an randomized lower bound for -edge coloring.
Weak -coloring on odd-degree graphs was introduced by Naor and Stockmeyer [24] as an example of a non-trivial problem that can be solved in constant time on graphs of bounded degree. As they show, the odd-degree condition is a necessary requirement; there are graph classes where nodes are allowed to have even degrees for which no constant-time weak coloring algorithm exists. Very recently, Balliu et al. [5] refined our knowledge in this regard by proving a tight lower bound of for weak -coloring on regular trees. While, from a theory perspective, weak coloring is interesting as “a problem with minimal symmetry breaking requirements” [20, p.139], a more concrete application exists in the form of certain resource allocation problems [24]. Naor and Stockmeyer provided an -algorithm for odd-degree weak -coloring, which was subsequently simplified and adapted to the dynamic setting by Mayer et al. [23]. While progress has been made for other relaxations of the standard node coloring problem, such as defective [20, 6] or arbdefective [7] coloring, the question by Naor and Stockmeyer whether their bound can be improved has remained open until now.
Naor and Stockmeyer’s work also initiated, together with [22], the line of research on local algorithms, and introduced the concept of locally checkable labeling (LCL) problems. This class of problems has been subject to many investigations, resulting in an almost complete understanding of the respective complexity landscape very recently [3, 4, 9, 10, 12, 13, 15, 16]. We remark that, while our speedup applies to LCL problems, it is not restricted to them: in particular, we do not require the considered graphs to be of constant degree.
2 Technical Overview
In this section, we will outline how to apply the speedup technique to obtain new bounds, including further helpful techniques for the application of the speedup. Moreover, we will give an overview of the available techniques to extend bounds achieved from our speedup (in our model) to the LOCAL model and to randomized complexities.
2.1 How to Apply the Speedup Results
The most natural application of our speedup technique is to prove a lower bound for some given problem . The roadmap is as follows. Starting with , we apply our speedup theorem iteratively, resulting in a problem sequence , where each problem can be solved exactly one round faster than the previous one. Given the mentioned conditions, our speedup works until we reach a problem that can be solved in [math] rounds; hence, in theory, the only thing we have to do is to look at our sequence and to determine which is the first problem in this sequence that is solvable in [math] rounds (usually depending on our parameters and/or ). If is the first problem solvable in [math] rounds, the problem we are interested in has time complexity . However, since Theorem 1 requires girth at least , only the lower bound of holds for general graphs.
While [math]-round solvable problems have a simple characterization, there is a catch: in general, the description of an inferred problem is much more complex than the description of the original problem. In fact, dealing with this explosion in complexity is one of the main challenges in applying our speedup. To this end, we provide two simplification techniques.
Relaxation
After inferring a new problem from via the speedup, we can try to find a relaxed version of (i.e., a problem that is provably not harder than ) that has a much simpler description, and use this problem as the starting point for the next speedup step. Alternating between relaxation and speedup, we continue this process until we reach a problem that is solvable in [math] rounds. Then is a lower bound for the time complexity of our initial problem . Of course, we can also stop before we reach a [math]-round solvable problem, and the respective index of the problem is also a lower bound.
If, informally speaking, we relax the problems obtained after each speedup step too much, the lower bound we obtain in the end might be asymptotically worse than the correct (tight) bound, or no improvement on existing bounds at all, so finding the right relaxation is a challenging problem. Moreover, in order to avoid having to find “good” relaxations for many very different problems, it is desirable to find relaxed problems that are similar (perhaps with different parameters of some kind) to previous problems in the problem sequence. The above outline captures exactly what we do in our lower bound for weak -coloring.
We remark that a dual version of the relaxation technique exists, where we obtain upper bounds on high-girth graphs by making problems harder instead of relaxing them. We will see a concrete example for this dual technique when we consider color reduction on rings as a special case of our speedup in Section 4.5.
Description Simplification
As the second tool in our toolbox for managing the increasing description complexity, we provide a “maximality constraint” that can be applied twice per speedup step in order to decrease the set of allowed outputs and thereby simplify the problem. Despite its simplicity, this technique can significantly reduce the description complexity of a problem, as we will see in Section 4.6. As we show in Theorem 2, this simplification comes at no cost, keeping the complexity of the problem under consideration as it is.
2.2 How to Lift Bounds to the Model.
A requirement for our speedup result is that the class of input graphs satisfies a property that we call -independence. Informally, a graph class is -independent if the following holds: If any node111If we want to be a bit more precise, the same also has to hold for any edge, where for simplicity, we also consider edges as computational entities. that has gathered all information in its radius- neighborhood extends its view by one hop along some edge, then the new information obtains does not affect what information might see if it extends its view by one hop along any other edge. In particular, if the nodes are equipped with globally unique identifiers, then -independence does not hold: if a node sees some ID in the extended view along some edge, it knows that this ID cannot be in any of the extended views along the other edges, due to our girth condition. While almost every other kind of symmetry breaking information commonly used, such as node colorings, edge colorings, edge orientations, or combinations thereof, satisfy -independence, extending bounds obtained by our speedup technique to the setting with unique IDs, i.e., the model, requires additional techniques. Note that upper bounds obtained by our technique immediately apply to the model (as basically every other symmetry breaking information can be inferred from unique IDs), hence we will focus on lower bounds in the following.
Method I: Randomization
As demonstrated in [2, 9], by explicitly incorporating error probabilities into the speedup steps, lower bounds in our setting can be lifted to the randomized model (which essentially guarantees -independence since no unique IDs are required). The obtained bounds are weaker than the original bounds from our setting, which is to be expected considering that allowing randomization can only lower the complexity of a problem. In a second step, the randomized bounds can then be lifted to the (deterministic) model, by exploiting gaps in the complexity landscape of so-called LCL problems [12], or by explicitly showing that the existence of a deterministic algorithm of some complexity would imply the existence of a randomized algorithm that violates the randomized lower bound [2]. The available evidence [2, 12] suggests that this detour via the randomized complexity does not weaken the deterministic bound: in both cases, the bound in our setting is identical to the bound in the model. This is not too surprising since the uniqueness of IDs might simply not be enough to change the complexity of a problem (as compared to, say, a setting with non-unique IDs); however, a proof for this is not known and would be a valuable step forward.
Method II: Order-Invariant Algorithms
A second technique to lift bounds to the model comes into play when we are interested in time complexities as a function of the maximum node degree of the input graph. By a result of Naor and Stockmeyer [24], if there is a constant-time algorithm solving a locally checkable problem in the model, then there is also an order-invariant algorithm with the same runtime, where order-invariant indicates that any node only uses the relative IDs, i.e., the order of the IDs it sees, but not the actual ID values, in order to determine its output. Hence, if there is an algorithm solving a locally checkable problem with runtime independent of , then we can essentially restrict attention to order-invariant algorithms. We provide an extension of our speedup result (Theorem 3) that shows that for order-invariant algorithms our speedup holds also in the case of unique IDs, i.e., in the model. We will make use of this extension when we prove our lower bound for weak -coloring (in the model).
3 Preliminaries
Graphs
All graphs we consider throughout the paper will be simple, undirected and connected. We denote the set of nodes of a graph by and the set of edges by , and we set . Furthermore, we write for the degree of a node and denote the maximum node degree of a graph by . A (-)regular graph is a graph where for all . The girth of a graph is the length of the smallest cycle. A matching is a set such that no two distinct edges from share an endpoint.
An important component in designing and proving our speedup is the idea to split the output of a node into parts that belong to incident edges. As the basis for a convenient representation of such a split output, define as the set of all pairs where and is an endpoint of . Finally, for a graph class , we denote the subclass of consisting of the contained graphs with nodes and maximum degree by , for all non-negative integers . Similarly, the subclass of graphs with maximum degree is denoted by .
Input-Labeled Graphs
Commonly, locally checkable problems are defined by allowed configurations of output labels and a specification of the given inputs and the considered graph class. Definitionwise, we will strictly separate between the outputs on one side (which will define what we call a problem) and the inputs222Note that in this work, we focus on the common case of problems where the correctness of the output does not depend on the given inputs. Adapting the speedup results to the case where output correctness depends on the inputs is not hard, but carries a significant technical overhead which would needlessly impair readability. and the graph class on the other side (which will be given by input-labeled graphs). This enables us to give a very general definition of the setting in which our results are applicable; more importantly though, this separation caters to the fact that the sequence of problems we obtain by repeatedly speeding up a given problem is independent of the considered inputs and the considered graph class.
For the definition of input-labeled graphs, we will also use the set defined above, which allows for a convenient way to encode, e.g., edge orientations. Furthermore, we will not restrict attention to graphs with bounded degree or to bounded input label sets; instead we will use the following more complicated, but also more general definition, which allows, e.g., to define graph classes of unbounded degree with an input edge coloring (which requires labels), or unique IDs (which come from a set that is a function of ).
Let be a (possibly infinite) set of input labels and a function such that is a finite subset333Note that, throughout the paper, we use the expression for the power set of set , as opposed to the set of functions from to . of , for all . A -input-labeled graph is a pair , where is a graph and is a function , where and are the number of nodes and the maximum degree of , respectively. For simplicity, we will usually omit the function and simply denote the -input-labeled graph by . We extend the notion of being -input-labeled to graph classes and say that a graph class is -input-labeled if each graph in is -input-labeled. Throughout the paper, all considered graph classes are assumed to be input-labeled, if not stated otherwise. Furthermore, if we want to avoid that nodes have to be able to compute uncomputable functions during the distributed computation, we can require additionally that the function that maps each pair to the graph class (as well as any other function involved in specifying parts of a distributed problem) is computable.
Problems
The speedup results we present apply to all locally checkable problems; however, formally, we will only consider problems where the validity of a global output essentially444Due to our particular way of splitting the output of a node into partial outputs for each incident edge, our formal definition of a problem will contain acceptable configurations for both edges and nodes; a more precise term for edge-checkability would thus be node-and-edge-checkability. can be checked on edges, i.e., there is a set of acceptable output configurations for the two endpoints of an edge, and the global output is correct if and only if the configuration for each edge is acceptable. Restriction to these problems does not lose generality for our purposes: by requiring that each node outputs the computed output labels (and the topology555Note that in general graphs where nodes do not have unique identifiers, the information a node obtains in rounds may not be enough to determine the exact topology of the subgraph induced by all nodes in distance at most ; however, since we will only consider radius- neighborhoods in graphs that have girth at least , the subgraph topology in each radius- neighborhood is a tree which implies that each node can determine the exact topology of its radius- neighborhood.) of its whole radius- neighborhood for some suitably large constant , any locally checkable problem can be transformed into an edge-checkable4 problem with the same asymptotic time complexity. For the definition of (our restricted version of) a problem, we will need the notion of a multiset, which is simply a set in which elements can have multiplicity larger than , but where, as usual, the order of elements does not matter.
Formally, for the scope of this paper, a problem is defined by
a (possibly infinite) set of output labels, 2. 2.
a function such that is a finite subset of , for all , 3. 3.
a function that maps each to a set of -element multisets where both elements are taken from , and 4. 4.
a function that maps each to a set of multisets with at most elements, all taken from .
The specification of ensures that problems that require the set of output labels to depend666Similarly to the case of unique IDs as input labels, one could also allow for problems where the set of output labels depends on . However, we are not aware of commonly studied problems of this kind, which is why we chose the simpler definition that only allows a dependence on . on , such as -coloring, are included in our problem definition. The sets and formalize which output configurations are allowed on an edge (i.e., at and ), resp. at a node (i.e., at , where are the edges incident to ). For instance, the problem of -coloring can be described by setting , , , .
Combining problems and input-labeled graphs, we define a realized problem as a pair , where is a problem and a -input-labeled graph class. For convenience, we may simply use the term “problem” for . We say that an algorithm solves a realized problem (or, equivalently, that solves on ) if, for any and any graph , assigns an output to each pair such that, for each edge , the multiset is contained in , and for each node , the multiset is contained in .
Model
Since we defined inputs to be part of the considered graph class (and hence assigned the duty of providing sufficient symmetry-breaking information to the choice of the graph class), we can use a very weak model of computation. This has the advantage that essentially all problems that are defined in a stronger model, such as the model, can also be formulated in our model. The only requirement that we need is that nodes are able to distinguish between their neighbors (or incident edges), which is why we formally choose the port numbering model [1] as our model of computation.
In the port numbering model, each node of the input graph has many ports which correspond to the edges incident to ; the two endpoints of an edge may have different ports corresponding to the connecting edge. Each node can communicate with its neighbors by sending messages along the connecting edges. Computation proceeds in synchronous rounds where in each round each node first sends arbitrarily large messages to its neighbors and then, upon receiving the messages sent by its neighbors, performs some arbitrarily complex local computation. Each node executes the same algorithm and has to terminate at some point, upon which it outputs its local part of the global solution (e.g., if the task is to find a proper node coloring, each node outputs its own color). The runtime of such a distributed algorithm is the number of rounds until the last node terminates. In the beginning of the computation, each node is aware of the parameters and and sees the input label assigned to for all incident edges , i.e., one input label per port. When terminating, a node assigns an output label from to each . We are interested in the worst-case runtime of a distributed algorithm, i.e., for worst-case input-labeled graphs with worst-case assignments of port numbers to edges.
A distributed algorithm is correct if the output labels assigned to the elements in satisfy the constraints encoded in and at each edge, resp. node. More precisely, we say that a distributed algorithm solves a realized problem if the above is true for each graph (where are defined by ). In this work, we formally only consider deterministic algorithms; the implications for randomized algorithms have been discussed in Section 2.2.
It is well-known that the best a node can do in rounds of communication is to gather the whole input information contained in its radius- neighborhood (as well as the topology5), and then decide on its output using only the collected information. Hence, a -round algorithm can be equivalently described as a function that maps each possible radius- neighborhood of a node to a tuple of output labels (one for each port of , i.e., each ). Note that in this description the degree of is only fixed when a radius- neighborhood has been chosen.
Neighborhoods
Let be a -input-labeled graph. For any node and any non-negative integer , we define the radius- neighborhood of (in ) as the collection of information that can obtain in rounds, i.e., the topology of the subgraph of induced by the set of all nodes in distance at most from , together with the input information each of these nodes possesses at the very beginning.5 Similarly, for any edge , we define the radius- neighborhood of (in ) as the collection of information that both and can obtain in rounds, i.e., the topology of the subgraph of induced by the set of all nodes in distance at most from both and , together with the respective input information. For convenience, we may occasionally forget about assigned labels or edges and consider or as unlabeled graphs or sets of nodes. We write and if we want to specify the underlying graph . We say that the radius- neighborhood of a node in graph and the radius- neighborhood of a node in graph are isomorphic and write if the collection of information can obtain in rounds in is the same as the collection of information can obtain in rounds in . We define isomorphisms for radius- edge neighborhoods analogously.
For simplicity, we will abuse notation, and use set operations to describe further collections of information, in the canonical way. For instance, we write . In particular, we are interested in “extensions” of such information collections, e.g., the information a node can obtain in rounds that a neighbor cannot obtain in rounds, or the information two neighboring nodes and can both obtain in rounds that cannot obtain in rounds. Hence, for any node , any edge , and any positive integer , we define and . Again, we write and to specify the underlying graph.
Consider some radius- neighborhood of an edge , and let be a graph class containing . We say that has an extension in (along ) if there is a graph such that , where is some node of . For convenience, we will identify and (and other nodes and edges in isomorphic neighborhoods), allowing us, e.g., to specify the output label of in graph . Analogously, we say that has an extension in (along ) if there is a graph such that , where is some edge of , and we will identify nodes (and edges) similarly as above.
-Independence
As mentioned in Section 2.2, our speedup results require a property called -independence. This property essentially only exists in graph classes with sufficiently high girth which is the reason (together with the difficulty of determining neighborhood topologies if the girth is too small) why we restrict attention to these classes of graphs. Roughly speaking, -independence is satisfied if for any fixed radius- neighborhood of a node , the set of extensions along one incident edge is independent of the sets of extensions along the other incident edges (i.e., fixing an extension along one edge does not influence which extensions are possible along the other edges), and if a similar statment holds for edge neighborhoods. See Figure 1 for an illustration.
Formally, we define -independence as follows.
Let be a -input-labeled graph class consisting of graphs with nodes and maximum degree and let be a positive integer. We say that is -independent if for any graph , any node , and any edge , the following two properties are satisfied:
Set \mathcal{EXT}^{t}_{v}(e):=\{\textrm{Ext}^{t}_{w,H}(e^{\prime})\mid H\in\mathcal{G}_{n,\Delta},e^{\prime}\in E(H),\textrm{ and }H,e^{\prime}\textrm{ satisfy }N^{t}_{H}(e^{\prime})\cong N^{t}_{G}(e)\textrm{ where}\\ \textrm{vw }\textrm{are corresponding nodes under this isomorphism}\}, and define analogously. Then, for each element and each element , there is a graph and an edge such that , , and , where, under the given isomorphism, corresponds to , and to . 2. 2.
For each edge incident to , set \mathcal{EXT}^{t}_{e^{\prime}}(v):=\{\textrm{Ext}^{t}_{e^{\prime\prime},H}(w)\mid H\in\mathcal{G}_{n,\Delta},w\in V(H),\textrm{ and }H,w\textrm{ satisfy }N^{t-1}_{H}(w)\cong N^{t-1}_{G}(v)\textrm{ where e^{\prime}e^{\prime\prime} are corresponding edges under this} . Then, for each indexed family with , there is a graph and a node such that and, for all incident to v, , where, under the given isomorphism, corresponds to .
While this exact definition of -independence is cumbersome, the intuition behind it makes it straightforward to check that -independence is satisfied for the usual symmetry breaking inputs that do not include unique IDs, such as node colorings, edge colorings777We assume all symmetry breaking inputs to be given in the natural way, i.e., input for a node to be encoded at all , and input for an edge to be encoded at all . In particular, in a [math]-round algorithm each node is aware of the colors and orientations of all incident edges (otherwise -independence might not be satisfied)., edge orientations, or combinations thereof (in common graph classes with graphs of girth at least ).
If is -independent for each , then we say that is -independent. If is a graph class such that, for each non-negative , is -independent (resp. -independent), then we say that is -independent (resp. -independent). Note that if is empty, it is trivially -independent for any positive .
4 The Speedup Theorem
In this section, we will present our speedup results and apply them to several problems. We start by defining our automatic speedup that produces a sequence of problems with decreasing runtimes, and subsequently prove our main theorem (Section 4.1). In Section 4.2, we introduce an important simplification technique that reduces the complexity of the descriptions of the problems in our sequence (by transforming the problems) and show that this technique is compatible with the main theorem. Then we prove that the main theorem can be extended to settings with unique IDs if we restrict attention to order-invariant algorithms (Section 4.3). In Sections 4.4 and 4.5, we will see how to obtain two known results by applying our speedup, confirming the viability and generality of the speedup technique. Finally, in Section 4.6, we will examine the effect of our speedup on weak -coloring, giving some intuition for the generalization of weak -coloring that is essential for our lower bound proof in Section 5.
4.1 The Theorem
In the following, we describe how to transform a given problem into a problem that can be solved one round faster. Our speedup consists of two steps: First we transform into a problem that, in some sense, can be solved half a round faster, then we transform into . Recall that a -round distributed algorithm (for ) is nothing else than a function that maps each possible radius- neighborhood to a tuple of outputs for . By saying that can be solved half a round faster, we mean that there is a distributed algorithm for where each node looks only at a neighborhood that is smaller than its radius- neighborhood , but larger than its radius- neighborhood . More precisely for deciding on the output at , node looks only at the radius- neighborhood of . An alternative way to look at this is to consider the edges of the input graph as the computing entities in an algorithm for ; each edge then decides on the outputs at and . This highlights the inherent duality of the two steps in our speedup: both steps are essentially the same with the difference that the role of nodes and edges is swapped.
Deriving Problems
Let be a problem, and let be the set of output labels and the functions used to define . We define the problem by specifying the set of output labels and the three required functions , , and as follows.
We set and , and we define as the set of all multisets , where , with the following property:
For any , , the multiset is contained in .
We define as the set of all multisets , where and , with the following property:
There exist elements such that the multiset is contained in .
Since is finite for all , also is finite. Hence, indeed satisfies the definition of a problem. Similarly to how we derived from , we will now derive from . More precisely, is defined as follows.
We set and , and we define as the set of all multisets , where , with the following property:
There exist elements such that the multiset is contained in .
We define as the set of all multisets , where and , with the following property:
For any , the multiset is contained in .
With the same reasoning as for , we see that satisfies the definition of a problem. For an illustration of the definitions, we refer to the concrete examples in Sections 4.4–4.6. In the same way as we derived and from , we can derive problems and from . In general, set , and for any positive integer , define recursively and .
The intuition behind the definition of (and the proof of Theorem 1) is that the restrictions for and given in Properties 1 and 2 are just weak enough that an algorithm that looks only at radius- edge neighborhoods can infer correct outputs by simulating an algorithm for on the radius- neighborhoods obtained by extending the radius- edge neighborhoods (with all possible extensions). The tightness of the two properties makes it possible that, conversely, we can also infer a (half-round slower) algorithm for from an algorithm for . A dual version of the above arguments holds for the relation between and . In the following, we show how to derive the algorithms for and mentioned above from an algorithm for in a black box manner (the converse direction will be part of the proof of Theorem 1).
Algorithm Speedup
Let be an arbitrary algorithm that solves some problem on some graph class . Let be the worst-case runtime of on the graph class . We define an algorithm for as follows.
Consider an arbitrary graph , and let denote the number of nodes of , and the maximum degree. In the following, we describe how an arbitrary node executing decides on the output label it will assign to , where is some arbitrary edge incident to . Node starts executing Algorithm by collecting the radius- neighborhood of edge . Note that has access to and and therefore can determine . Then, for each output label , node determines whether has an extension in such that would assign output to according to . Let be the set of all for which such an extension exists. In other words, simulates on each radius- node neighborhood occurring in that can be obtained by extending in direction of (seen from ), and collects in all output labels that outputs in each such neighborhood at the node-edge pair corresponding to . Finally, node outputs at . This concludes the description of . We will argue about the correctness and the runtime of (and ) in the proof of Theorem 1.
Analogously, we define an algorithm as follows, by deriving it from . Again, we focus on the output assigns to according to , where . Similarly to before, starts executing Algorithm by collecting the radius- neighborhood of . Then, for each output label , node determines whether has an extension in such that would assign output to according to . Finally, collects all for which such an extension exists in a set , and outputs at . Now we are set to prove our main speedup result.
Theorem 1**.**
Consider some arbitrary problem and the derived problem . Let be a -input-labeled graph class consisting of -node graphs with maximum degree . Assume that is -independent for some positive integer and contains only graphs of girth at least . Then, the following two statements are equivalent:
- (1)
There is an algorithm solving in time .
- (2)
There is an algorithm solving in time .
Proof.
We first show that (1) implies (2). Let be an algorithm solving in time . Consider the algorithms and derived from in the manner described above. We argue that solves and that for deciding on the output assigned to some node-edge pair in the execution of , node only looks at . Furthermore, we argue that solves in time . The statement that only looks at directly follows from the definition of ; analogously, the runtime of follows from the description of . Hence, what is left to show is that the two derived algorithms actually solve the two derived realized problems. We start with and .
Let be an arbitrary graph in , and let be an arbitrary node, the edges incident to , and the other endpoint of . Furthermore, let denote the output assigns to , and let the output for other node-edge pairs be denoted analogously. We have to show that A) the multiset is contained in , and B) the multiset is contained in .
For A), observe that, by the definition of , for any two output labels and , has both an extension in such that would output at and an extension in such that would output at . Due to the -independence of , there is a graph such that the radius- neighborhood of some node is isomorphic to , which implies that, in , would output at and at . Since solves , it follows that the multiset is contained in . We conclude that for any , the multiset is contained in , which shows that the multiset is contained in , by Property 1 in the definition of .
For B), consider , and let be the outputs would assign to , respectively, when executing . Observe that, for any , has an extension in such that would output at , namely . Hence, by the definition of , we have . Since the multiset is contained in (due to solving ), it follows that the multiset is contained in , by Property 2 in the definition of . This concludes the proof showing that solves .
The proof for showing that solves is analogous. Note that this proof does not rely in any way on the fact that and are derived from another problem, resp. algorithm. The proof works just as well if we replace by any other problem and by any other algorithm that solves the problem on and has the property that each node decides on the output for by only looking at .
Now, we show that (2) implies (1). Let be an algorithm solving in time . We derive an algorithm as follows. For each graph and each edge , let and be the outputs assigns to and , respectively, when executed (by ) on . Since solves , there exist elements such that the multiset is contained in , by Property 3 in the definition of . We define algorithm to output at and at , when executed on ; thereby trivially satisfies one of the two conditions for solving (the one concerning ). By Property 4 in the definition of (and the fact that solves on ), algorithm also satisfies the other condition, concerning . Hence solves . Observe that due to the design of , each node can determine the output at according to by looking only at .
Now, using an analogous argumentation, we can define an algorithm that solves , where each node can determine the output at by looking only at . Hence, there is an algorithm solving in time . ∎
Theorem 1 provides an explicit version of our speedup guarantees for graph classes . We formulate the implications for general graph classes in the following corollary of Theorem 1.
Corollary 1**.**
Consider some arbitrary problem , and let be a -input-labeled graph class and some function. If, for all , it holds that is -independent and contains only graphs of girth at least , then the following two statements are equivalent:
- (1)
There is an algorithm solving in time .
- (2)
There is an algorithm solving in time .
Note that Theorem 1 and Corollary 1 do not contradict the existence of known gaps [12, 13, 24] in the time complexity landscape of LCL problems since the problem obtained after applying our speedup a non-constant number of times is not necessarily an LCL problem anymore (but is still locally checkable).
4.2 Simplification via Maximality
In this section, we will present a technique that simplifies the derived problems and without affecting the correctness of Theorem 1. Consider the (common) case that the input labeling of a considered graph class contains some symmetry breaking information for the two endpoints of each edge, i.e., any two nodes , connected by an edge can select the same of the two endpoints. We will consider the simplest case that the desired symmetry breaking can be achieved by both nodes by just looking at their [math]-round input information, i.e., the respective input graph contains edge orientations (where the orientation of an edge is encoded in both input labels assigned to and ). Note that if, instead, the nodes have to look up to a distance of (or, more generally, some constant ), as, e.g., in the case of an input node coloring or input IDs, then the simplification technique we present still works for all cases except for the speedup of a -round (resp. any -round) algorithm to a [math]-round (resp. one-round-faster) algorithm.
The idea behind the simplification technique is that we can reduce the number of output labels the derived algorithm for can use, by making the set smaller; as we will show this can be done in a way that ensures that the algorithm (or, more precisely, its adapted version) can still solve (the new version of) in the same runtime as before. More specifically, observing that the definition of contains an existential and a universal statement, we can replace each multiset in by some with if still satisfies the universal Property 1 in the description of ; the correctness of the existential Property 2 is not affected by this and shrinks. The symmetry breaking on edges mentioned above is a necessary ingredient for performing this process in a consistent manner (in the dual case for , we need analogously a symmetry breaking for nodes; however, this is already provided by the port numbers). As we will see in Sections 4.4–4.6, this approach can lead to a significant decrease in usable outputs. For a simplified problem, we may compress the problem description by assuming that the respective and contain only those outputs that can actually occur in a correct solution (i.e., that occur in both some multiset contained in and some multiset contained in ).
Simplified Problems
We obtain the simplified version of by replacing Property 1 in the description of by the following extension:
For any , , the multiset is contained in and the sets and are maximal with this property, i.e., for any , the multiset does not satisfy Property 1, and for any , the multiset does not satisfy Property 1.
Analogously, we obtain the simplified version of by replacing Property 4 by the following extension:
For any , the multiset is contained in and the sets are maximal with this property, i.e., for any , it holds that, for any , the multiset does not satisfy Property 4.
However, just performing the latter replacement as it is will result in a (simplified) problem that is derived from , not from . Hence, we define by not only performing the above replacement, but also replacing by in Property 3, where is the analogue to in (and actually the only part in which the definitions of and differ). Now, similarly to above we can define simplified problems recursively for any positive by setting and . Note that for simplicity, we write, e.g., instead of when talking about the simplified problem if it is clear which problem is considered.
In the following theorem, we show that Theorem 1 still holds if we replace a derived problem by its simplified version. Naturally, this implies that also an analogous version of Corollary 1 holds for our simplified problems. Moreover, from the proof of Theorem 2, it follows that we can arbitrarily decide after which performed problem derivation we apply the simplification technique and after which we do not, without affecting the runtime of the derived problem.
Theorem 2**.**
If we augment the conditions of Theorem 1 by the requirement that the inputs for the graphs from contain an orientation for each edge, then the following two statements are equivalent:
- (1)
There is an algorithm solving in time .
- (2)
There is an algorithm solving in time .
Proof.
Recall that, as observed in the proof of Theorem 1, the speedup of one round is actually obtained by two independent speedups, going from to and from to . In particular, the fact that is a derived problem is not relevant for the second speedup. Therefore (and due to the two speedups being analogous), it is enough to argue that if there is an algorithm solving by looking only at radius- edge neighborhoods, then there is also an algorithm solving by looking only at radius- edge neighborhoods, and vice versa. Going from to is easy: since is a more restrictive version of , any algorithm solving also solves , hence setting is sufficient.
For the other direction, given an algorithm as described above, we transform it into an algorithm as described above, as follows. Consider an arbitrary edge in some graph . From the orientation of , nodes and infer an ordering of the set in some fixed way (so, both and infer the same ordering, in [math] rounds). Then, and compute the outputs and of at and , respectively. Since these two outputs are uniquely defined by (and ), both and can compute both outputs by only looking at . Now both nodes do the following (internally, in [math] additional rounds), where we assume w.l.o.g. that is first in the computed ordering. First, in a fixed way888More formally, this is to say that and apply a deterministic algorithm to infer which maximal superset of is picked (based on knowing and ), so that both nodes pick the same superset., they pick a set , that is maximal among all such sets with the property that the multiset satisfies Property 1 in the definition of . Then, analogously, they pick a set , that is maximal among all such sets with the property that the multiset satisfies Property 1. Finally, outputs at , and outputs at . Now it is straightforward to check that algorithm , defined in this way, actually solves , and each node can determine the output at by only looking at . ∎
4.3 Order-Invariant Algorithms
As mentioned in Section 2.2, if we are interested in runtime bounds as a function of , then restricting our attention to order-invariant algorithms allows our speedup to be applied to models where nodes are equipped with unique IDs, such as the model. The following theorem extends Theorems 1 and 2 (and, consequently, the respective corollaries) to the case of order-invariant algorithms and is an important building block in our lower bound proof for weak -coloring. Recall that an order-invariant algorithm [24] is defined as an algorithm where each node’s output only depends on the relative order of the IDs it sees (and possibly other input information), but not their absolute values (i.e., if the input ID assignment in the view of a node is changed to another assignment with the same ordering of the nodes according to the IDs, then the output of must remain the same).
Theorem 3**.**
Consider some arbitrary problem and the derived problems and . Let be a -input-labeled graph class consisting of -node graphs with maximum degree . Assume that is -independent for some positive integer and contains only graphs of girth at least . Let be a finite set of identifiers satisfying . Let be the class consisting of all999If , then is empty. If one wants to avoid empty graph classes, one can simply require , here and in Lemma 4. (input-labeled) graphs obtained by taking a graph from and assigning unique identifiers from to the nodes of the graph. Then, the following statements are equivalent:
- (1)
There is an order-invariant algorithm solving in time .
- (2)
There is an order-invariant algorithm solving in time .
- (3)
There is an order-invariant algorithm solving in time .
Proof.
We first show that (1) implies (2). Assume there is some order-invariant algorithm solving in time , and for convenience, denote this algorithm by . Our theorem would follow from Theorem 1 if was -independent; unfortunately, we only know that is -independent. We circumvent this issue by adapting our definition of the two algorithms and . Recall the definition of . We change it only very slightly, as follows, again focusing on the output a node will assign to , where . Directly after collecting the radius- neighborhood of edge , node internally changes the IDs assigned to the nodes in . More precisely, in a fixed way, depending only on the relative ID values (i.e., in an order-invariant way), each ID is changed (to some ID from ) so that the order of the nodes in according to their IDs stays the same and any two of these IDs differ by at least . By our constraint on the size of , this is possible as the number of nodes in is at most . After that, node proceeds as given in the definition of . Clearly, this new version of is order-invariant and each node only looks at ; in the following we argue that it also solves .
For seeing that the argumentation in proof part B) of Theorem 1 still works, it is sufficient to observe that the set our modified outputs at is a superset of the set the original would output at . This is the case due to the order-invariance of and the fact that is at least as large as the number of nodes in (so, for each actual ID assignment to , there is an ID assignment with the same order on the nodes in that respects ’s internal ID choices for the nodes in ). Looking at the individual steps in proof part A) of Theorem 1, we see that, apart from changing by updating the IDs as done by internally, the only argument we have to change due to our new definition of is the argument where the -independence is used. However, instead of using the (non-existent) -independence of , we can simply use the -independence of , combined with the order-invariance of and the fact that is at least as large as the number of nodes in and together (where is the other endpoint of ). An analogous adaptation of (derived from the adapted ) combined with an analogous proof adaptation yields our first implication.
For the implication from (2) to (1), we can use an analogous argumentation to the one given in the proof of Theorem 1. The only place where we have to be careful is when we choose elements (such that ) during the definition of (and in the analogous place during the definition of ). Here, in order to ensure the order-invariance of , we have to choose and in a way that ensures that the same are chosen for the same relative ID assignments in . This is possible due to the order-invariance of .
For the implication from (2) to (3), observe that the same algorithm transformation (“from to ”) as performed in the proof of Theorem 2 (where, in the current case, we infer the required edge orientations from the (relative) IDs) ensures that the obtained algorithm solves in time ; moreover, as the outputs given by the transformed algorithm depend only on the outputs given by the initial algorithm and the inferred edge orientations, the order-invariance of the transformed algorithm follows from the order-invariance of the initial algorithm. Since the above statements hold analogously for the case of (instead of ), the desired implication follows. The implication from (3) to (2) is shown as in the proof of Theorem 2. ∎
4.4 Warm-up I: Sinkless Coloring and Sinkless Orientation
In this section, we will apply our speedup technique to the problem of sinkless orientation. As we will see, our technique gives a deterministic version of the randomized speedup for sinkless orientation presented in [9], showing that our speedup in some sense encompasses known techniques for concrete problems as special cases. In particular, the duality between sinkless orientation and sinkless coloring which is essential for the lower bound proof in [9] is automatically produced by our speedup.
The problem of sinkless coloring (as it is given in [9]) is defined as follows: Given a -regular edge -colored graph, each node has to output an orientation for each incident edge such that the two endpoints of an edge agree on the orientation of the edge and each node has at least one outgoing edge.
While our speedup does not require the edge coloring, we will add edge orientations to the input in order to be able to apply our simplification technique. Note that adding edge orientations or some other symmetry breaking information as input is required in the deterministic case since the random bits used to break symmetry in the randomized case are not available and the edge coloring alone is not sufficient for breaking symmetry.
While it is possible to set our initial problem to be sinkless orientation (in which case the derived problem is also sinkless orientation, roughly speaking because the output for sinkless orientation describes an edge property), it is more natural to choose to be the dual problem called sinkless coloring (since for this problem the output describes a node property). Sinkless coloring is defined as follows: Given a -regular edge -colored graph, each node has to output one of the edge colors such that at least one of the two endpoints of any edge outputs a different color than the color of the edge.
Due to our choice of having one output per element of , not per node, there is a simple (and canonical) way to encode this problem in our setting, even without an input edge coloring. Define as follows (where an output of at indicates that chooses the color of ). Set
[TABLE]
where the only element of is a multiset with elements. Applying our speedup transformation, we obtain for the derived problem
[TABLE]
Applying our simplification technique, we see that the only element of that is maximal is . Hence, we obtain for
[TABLE]
Writing [math] for and for , we can describe equivalently by
[TABLE]
This specification describes exactly the problem of sinkless orientation, where an output of at indicates that orients away from itself, and an output of [math] at that orients towards itself. Now, deriving leads to the problem specified by
[TABLE]
Observing that the only maximal element of is , we see that is given by
[TABLE]
Writing [math] for and for , we see that we are back at sinkless coloring, i.e., .
Applying Theorem 2 iteratively, we see that the existence of a -independent graph class of girth at least implies that any -round algorithm for sinkless coloring can be transformed into a [math]-round algorithm for sinkless coloring. Since the latter does not exist, whereas, for any , such high-girth -independent graph classes exist for some (as can be shown using [8, Chapter III, Theorem ], or, in the case of an input edge-coloring, [9, Lemma 9]), we obtain an lower bound for sinkless coloring, and therefore also for sinkless orientation, for each .
4.5 Warm-up II: Color Reduction
In this section, we will give an example of how to use our speedup to obtain a (well-known) upper bound. The problem we are interested in is the problem of properly -coloring a ring in the model. Cole and Vishkin [14] and Goldberg et al. [18] gave an upper bound of for this problem, which was subsequently proved to be tight by Linial [22]. The upper bound is based on the idea of interpreting the given unique IDs (which are bitstrings of length ) as colors and then reduce the number of colors in each round of the distributed computation exponentially. Other methods of achieving such an exponential (or doubly exponential) color reduction were devised later (see, e.g., [21, 23, 26]).
We show that our speedup provides a doubly exponential color reduction (on rings). In order to achieve this result, we make use of the technique for upper bounds outlined in Section 2.1: after deriving problem from , we change to a problem that is provably at least as hard as and has a much simpler description. As we will see in the following, if is the problem of -coloring, then the resulting problem is -coloring for some satisfying . We will focus on the case of before arguing about general .
Formally, we can describe -coloring as follows101010Note that we use in the description that the input graphs we are interested in are rings. Since our notion of a problem, however, is independent of the actually considered input graphs, we still have to define the three functions for any . The problem descibed by our specification is also defined on graph classes that do not only contain rings, but it might be the case that there is no algorithm that solves the respective realized problem..
[TABLE]
For the derived and simplified problem , we obtain
[TABLE]
After the following derivation, we do not apply our simplification technique, hence obtaining problem characterized by
[TABLE]
Now we transform into a problem that is at least as hard as , by removing outputs from while also (potentially) removing elements from and . More specifically, our new output set contains exactly the sets with the following properties:
- •
For each , we have .
- •
For each set with , exactly one of and the complement of (in ) is an element of .
We argue that this particular choice for satisfies two desirable properties, namely, that for any with , and for any . For the first property, observe that, for any with , there must be some set with such that is contained in exactly one of and . It follows that the other of the two contains the complement of , which in turn implies that there are elements which are complementary (in ). We conclude that , which implies .
For the second property, observe that, for any and any , we know that and are not complementary. Since both and are sets of cardinality , it follows that , which in turn implies that . Hence, .
Let be obtained from by removing all contained multisets with . Similarly, let be obtained from by removing all contained multisets with . Let denote the number of elements of . Due to the two properties of we just proved and by renaming the elements of , we can now describe as follows.
[TABLE]
We conclude that is the -coloring problem.
Counting the number of elements of , we obtain . Analogously to the special case of , we can derive the problem for general even (where is defined as -coloring). A close look at the general version of reveals that we then obtain , which implies if . Applying our speedup theorems, we conclude that the existence of a -coloring algorithm implies the existence of a -coloring algorithm that requires at most one additional round. Since is a monotone function, this implies an (asymptotically) doubly exponential speedup per round, which in turn implies an upper bound for -coloring a ring. Note that, since our speedup theorems require -independence, we will not assume that the given IDs are globally unique, but only that any two adjacent nodes have different IDs (which constitutes a coloring). However, since this assumption is weaker than assuming globally unique IDs and we are proving an upper bound, the bound directly applies to the model.
4.6 Weak -coloring
The goal of this section is to give an intuition for the generalization of weak -coloring we use to prove our lower bound in Section 5. Recall that, in order to apply our speedup technique iteratively without having to look at each problem in the produced problem sequence individually, we would like to start with a problem for which applying our speedup technique results in some problem that can be relaxed to a problem that is very similar to . Ideally, the relaxed problem is the same as except for that some parameters in the problem description are different. Also, our initial problem should be similar to the problem we want to prove a lower bound for, i.e., weak -coloring; the simplest case would be that is a relaxation of weak -coloring so that lower bounds for immediately apply to weak -coloring.
In order to obtain some intuition for the effects of our speedup on problems that are similar to weak -coloring, the obvious approach is to apply the speedup to weak -coloring itself, which we do in the following. Recall that in the weak -coloring (resp. weak -coloring) problem, each node has to output a color from, say, the set (resp. ), such that there is at least one neighbor with a different color (or the node has no neighbors). Note that this formulation of the problem is not even edge-checkable; before we can apply our speedup we first have to bring weak -coloring in a form that adheres to our definition of a problem.
Problem
Consider the following problem : Each node has to output a color from and a pointer to an adjacent node. The output is correct if each node points to a node that has a different color than . Formally, we can describe by setting
[TABLE]
Note that, in the formal description, we use the fact that we are only interested in -regular graphs since we will prove our lower bound in Section 5 already for this class of graphs.
Clearly, satisfies our definition of a problem; furthermore, we argue that if there is an algorithm solving weak -coloring, then it can be transformed into an algorithm for that requires only one additional round. This is easy to see: each node can learn the color of each adjacent node (according to the algorithm for weak -coloring) in this additional round and then point to a node of different color. We call the pointer version of weak -coloring; however, in this section we will simply refer to as weak -coloring (as it is essentially the same problem).
Problem
In the following, we derive problem from . According to the process of deriving , described in Sections 4.1 and 4.2, we obtain . Furthermore, consists of all multisets such that A) each is a subset of , and B) there is some such that and for all . The most interesting part of the definition of is as Property 5 comes into play here. Observe that for each multiset contained in , it holds that if , then both and are not contained in . Taking the maximality constraint of Property 5 into account, we obtain that contains exactly the multisets where
[TABLE]
Note that the definition of allows for one of and being empty; however, our function ensures that an empty set cannot be chosen as output, thereby ruling out the first listed row as a possibility for outputs on some edge. Hence, as we can see from the description of , there are only outputs that can be used by any correct algorithm for . This indicates that there may be a simpler, or more accessible, (equivalent) description of , and indeed there (arguably) is, as given in the following.
An Equivalent Description
Set , i.e., the set of allowed outputs is the set of all trit111111A trit is the ternary analogue of a bit. sequences of length excluding the two sequences and . For any trit sequence of length , any , and any , we say that has an at position if . Define to consist of all multisets of cardinality of trit sequences from such that there is an index such that (at least) one of the trit sequences in the multiset has a at position and none of the trit sequences has a [math] at position . Furthermore, define to be the set of multisets of trit sequences from such that the tritwise addition of and yields the trit sequence . For instance, the multiset of cardinality is an element of (pick ), and the multiset is an element of .
Why is this an equivalent description for ? Consider mapping the set of the “usable” outputs in the initial description to our new output set . More specifically, map each output to the trit sequence that satisfies that is equal to the number of elements from contained in . For instance, is mapped to the trit sequence , whereas is mapped to . Clearly, this mapping is a bijection. Note that if is mapped to a trit sequence satisfying for some , then is contained in , whereas is not. Using this observation, it is straightforward to check that the two functions in the two descriptions are equivalent. In order to see that the same holds for , it is sufficient to observe that in the initial description of , for each multiset contained in , the set contains exactly two elements where the first entry is , and exactly two elements, where the first entry is . Hence, our two definitions describe the same problem. We will use the second, arguably simpler description of for the next step on our agenda, i.e., for the task of deriving from .
Problem
According to the derivation process for , we obtain . Furthermore, consists of all multisets such that A) , and B) there exist elements such that the tritwise sum of and is . Finally, consists of all multisets of sets such that A) for any , there is an index such that (at least) one has a at position and none of the has a [math] at position , and B) the multiset is maximal with Property A), i.e., if we add a new element from to some arbitrary in the multiset, then the multiset does not satisfy Property A) anymore.
While the description of provides a good intuition for which -element multisets are actually contained in , the maximality condition for makes it harder to see the same for . In fact, is an excellent example for the power of the simplification technique introduced in Section 4.2: with some work, it is possible to show that actually contains only elements (or fewer if is very small)! As the goal of this section is merely to provide intuition for our lower bound approach in Section 5, we will only highlight the important observations that can be made by examining those elements.
Problem
As outlined in the beginning of this section, we would like to relax to a problem that is similar to weak coloring and has a simpler description. A very natural idea is to try to relax to weak -coloring as follows. Map each of the elements of to a different color and then show that any algorithm for also solves weak -coloring in the following way: Each node executes to determine the outputs at all , then checks which element of the outputs constitute together, and finally outputs the color corresponding to the element of at each . On top of that, each node (somehow) infers from its output according to an adjacent node it can safely point to (i.e. an adjacent node that must have a different color).
While for most of the elements in this approach can be made to work121212In fact, using several speedup-related tricks, it is possible to relax to to weak -coloring; however, these tricks seem to apply only for our very specific case of being weak -coloring and do not generalize to weak -coloring or similar problems., there is one element in that some node and all its neighbors might “output” (if we gather all partial outputs of the respective node according to algorithm ). In other words, the envisioned solution of for weak -coloring can make and all its neighbors output the same color, which makes it impossible for to output a correct pointer. However, this particular element of (recall that such an element is a multiset of cardinality ) has a very special form: we can write it as , where and are the only two elements of that involve or (i.e., if the outputs of two adjacent nodes according to both correspond to and the output at is or , then the output at must be , where ).
This leads to the idea of modifying our weak -coloring problem slightly: Instead of having to output one pointer that has to point to a differently colored node, a node can also choose to output two pointers and one “inverse pointer” (at three different ), where a (non-inverse) pointer has to point to a differently colored node or to an inverse pointer. For this modified problem, any node has a simple way to infer a correct output if its output according to corresponds to : node simply outputs a pointer at the two where outputs or , and an inverse pointer at the one where outputs .
As we will see (in Lemma 2), this particular property of is not an isolated case but characteristic for problems that are derived from the problems we generalize weak -coloring to. More precisely, for any such and any in the corresponding , we can essentially write as such that 1) , and 2) if two nodes connected by an edge both output , and outputs some element from at , then has to output some element from at (by the definition of the corresponding ). This insight suggests the generalization of weak -coloring that we present in Section 5.
5 A Tight Lower Bound for Odd-Degree Weak -Coloring
This section is devoted to proving a lower bound for weak -coloring on odd-degree graphs that matches the upper bound presented by Naor and Stockmeyer [24]. In Section 5.1, we introduce the problem of superweak -coloring, which is a generalization of weak -coloring, and prove a speedup lemma for this new problem. More precisely, we show that we can save one round in the runtime if we increase the parameter in the problem definition exponentially (a constant number of times). Subsequently, in Section 5.2, we will use the speedup lemma to prove our lower bound.
5.1 A Speedup Lemma for Superweak -Coloring
We start this section by defining superweak -coloring and applying our speedup to it, resulting in a problem . By showing (Lemma 3) that, for some sufficiently large , superweak -coloring is a relaxed version of (i.e., superweak -coloring can be solved at least as fast as ), we can infer our speedup lemma (Lemma 4).
Our main technical ingredient to prove Lemma 3 is a statement (Lemma 2) that, roughly speaking, ensures131313Note that this is essentially a reformulation of the insight obtained at the very end of Section 4.6. that for each output of a node executing (i.e., for each collection of the partial outputs at the for a fixed node ), we can select two disjoint sets and of edges incident to such that and the following property is satisfied: If two adjacent nodes give the same output and the connecting edge is contained in , then must also be contained in . The edges in , resp. , will be exactly those edges where outputs the demanding, resp. accepting, pointers specified in the definition of superweak -coloring.
In order to prove Lemmas 2 and 3, we require the existence of a structural property of the elements in the set (defining ), which is provided by Lemma 1. In particular, this property is responsible for bounding the increase from to .
Superweak -Coloring
Let be some integer. We define the problem of superweak -coloring as follows (using the insights obtained in Section 4.6). Each node has to output a color from and a number of pointers to adjacent nodes. There are two kinds of pointers, demanding pointers and accepting pointers. A node is not allowed to point to the same neighbor with two pointers; furthermore, the number of demanding pointers a node uses must be strictly greater than the number of accepting pointers it uses (the latter can be [math]). The number of accepting pointers a node is allowed to use is bounded from above by . The output is correct if for each demanding pointer from a node to a node , the two nodes have different colors or has an accepting pointer pointing to . For an illustration, see Figure 2.
Formally, we define superweak -coloring as follows, again using that we are only interested in regular graphs (since our lower bound will hold already for regular graphs). Set
[TABLE]
Problem
Now we examine how the derived problem looks like, if we define to be superweak -coloring. We obtain . Furthermore, consists of all multisets such that A) each is a subset of , and B) there are some and some partition such that if , if , if , , and . Similar to the case of being derived from weak -coloring, observe that for each multiset contained in , it holds that if , then both and are not contained in , whereas is always contained in both and for all . Again taking the maximality constraint of Property 5 into account, we obtain that contains exactly the multisets with the property that, for each , we have
[TABLE]
where and denote the subset of and , respectively, consisting of all elements where the first entry is .
An Equivalent Description
Analogously to our approach in the case of being weak -coloring, we can give a different description of the same problem: Define to be the set of all trit sequences of length . Define to consist of all multisets of cardinality of trit sequences of length such that there is an index such that, in the multiset, the number of trit sequences that have a at position is strictly greater than the number of trit sequences that have a [math] at position , and there are at most trit sequences that have a [math] at position . Finally, define to be the set of multisets of trit sequences of length such that the tritwise addition of and yields the trit sequence .
In order to see that this new description is equivalent to the old one, we can use the same argumentation as in the analogous case: essentially, we map to a [math] at position , to a , and to a , and then observe that the constraints are the same in both descriptions.
Problem
Unavoidably, the next step is to derive from . We obtain . Furthermore, consists of all multisets such that A) are sets of trit sequences of length , and B) there exist elements such that the tritwise sum of and is . Finally, consists of all multisets of sets of trit sequences of length such that A) for any , there is an index such that the number of that have a at position is strictly greater than the number of that have a [math] at position , and there are at most many that have a [math] at position , and B) the multiset is maximal with Property A), i.e., if we add a new trit sequence of length to some arbitrary in the multiset, then the multiset does not satisfy Property A) anymore.
The following lemma highlights a structural aspect of the elements in .
Lemma 1**.**
Let be an arbitrary multiset from , and assume that . Then there is a unique element that has multiplicity at least and contains the trit sequence .
Proof.
Let be the set of all elements (i.e., of all sets of trit sequences of length ) of that occur in with multiplicity at least . Set . Let be the multiset obtained by replacing in all occurrences of , except exactly many, by , for each . Then the multiplicity of each in is exactly . We claim that .
For a contradiction, assume that . Since (and for any ), it is not possible that violates only Property B) in the description of . Hence, violates Property A) in that description, which implies that there are such that, for each index , the number of (where ) that have a [math] at position is greater than or at least as large as the number of that have a at position .
For each , let be a trit sequence with multiplicity at least in the multiset . Since each has multiplicity in (and the number of distinct trit sequences of length is ), such exist, by the pidgeonhole principle. Now consider trit sequences specified as follows:
First, for each index that satisfies and for all , set . Then, for each trit sequence that has multiplicity in , first compute where is the number of already set to in the first step, then pick both an arbitrary containing and many so far unused indices with , and set for these indices. Then, for each trit sequence that has multiplicity in , pick both an arbitrary containing and many so far unused indices with , and set for these indices. Finally, for each remaining unused index , which must necessarily satisfy for some , set (where is as specified above).
Since each has multiplicity in , such exist, again by the pidgeonhole principle. Consider the two multisets and . Observe that each trit sequence of length has either multiplicity at least in both multisets, or the same multiplicity in both multisets. Hence, for each , the number of that have a [math] at position is the same as the number of that have a [math] at position , or both numbers are at least . For analogous reasons, the statement still holds if we replace each [math] by a . Taking a close look at Property A), we conclude that if Property A) fails for due to choosing , then Property A) also fails for due to choosing . Since we already established the former, we obtain a contradiction to . Thus, .
As for each and , we see that , by Property B) in the description of . Hence, there is at most one element in with multiplicity greater than . Since contains at most distinct elements, it follows that there is a unique element with multiplicity at least141414Note that, for the sake of simplicity, we do not attempt to prove the sharpest bounds possible, but rather bounds that are convenient to work with. , due to and our assumption that .
In order to see that contains , assume for a contradiction that it does not, and define to be the multiset obtained by adding the trit sequence to one of the contained in , say to . Analogously to the above, we can infer that violates Property A) and consider a violating choice . We must have in that violating choice; if we now replace by some trit sequence contained in that has multiplicity at least in the multiset (which must exist due to the pidgeonhole principle), we obtain a choice that ensures that violates Property A), as can be shown using analogous arguments as above. This, again, yields a contradiction to , thereby proving that contains . ∎
Note that the element in Lemma 1 depends on the choice of . If we want to specify the respective , we write instead of .
The following technical lemma provides us with two index sets and (depending on the local output of an algorithm for ) with desirable properties that we will use in Lemma 3 to infer where to output demanding and accepting pointers (when transforming an algorithm for into an algorithm for superweak -coloring). Curiously, our proof of Lemma 2 requires the application of Hall’s marriage theorem [19].
Lemma 2**.**
Let be an arbitrary multiset from , and assume that . Let be a function that maps each to an element of . Let be the subset of indices satisfying and151515The statement that already implies that , by Lemma 1, but let us be very explicit here, for convenience. . For any , let denote the subset of indices such that there exists an index such that and . Then there is some such that and for any , we have .
Proof.
Consider the bipartite (unlabeled) graph defined as follows. One side of the bipartition is given by a node for each index ; the other side by a node for each index . Deviating from our conventions, we denote the set of by and the set of by ; furthermore, for each subset and each subset , we set and , respectively. The edge set is given by creating an edge between and if and only if and . In other words, is the set of neighbors of some set in if and only if . We denote the set of neighbors of a set , resp. , in by , resp. .
We start proving the lemma by showing that there exists some set such that . Assume for a contradiction that no such set exists, i.e., for each subset , we have . This implies that for each subset , we have . Now we can apply Hall’s marriage theorem [19, Theorem 1] that states that in bipartite graphs with the bipartition given by the two node sets and , the above condition is sufficient (and also necessary) for the existence of a matching that covers all nodes in . Let be such a matching. Hence, for each , there is an edge with . Note that no node can be matched to in since there is no edge in connecting to . We say that two edges are touching if there is some such that both and have an endpoint in .
Our next goal is to find a matching with the property that for each , exactly one of and is matched. To this end, find a partition such that two edges are contained in the same if and only if there exists a sequence of edges such that and are touching for all . Clearly, such a partition exists, and it is easy to see that this partition is unique up to reordering of the . Observe that the definition of touching edges implies that for each edge , there are at most two other edges such that and are touching. It follows, due to the definition of our partition and , that each of the has exactly one of the two following forms:
There is a sequence of distinct indices from such that , , and node is not matched in . 2. 2.
There is a sequence of distinct indices from such that .
We say that that fall into the first category are pathlike, and that from the second category are ringlike. Note that for pathlike , the choice of the sequence of indices is unique, whereas for ringlike , the choice is only unique up to certain reorderings. In the latter case, fix an arbitrary one of those sequences. We observe that for ringlike , the parameter is even: As the edges contained in are also elements of , we have , by the definitions of and . Since , we obtain , which implies that must be even.
In order to find our desired matching , we will describe for each which of the edges contained in are chosen as elements of . For both pathlike and ringlike , choose the edges where is odd into . We argue that the obtained matching has indeed the property that for each , exactly one of and is matched: Consider an arbitrary . Either both and are matched in our initial matching , in which case the two edges from responsible for this are in the same and exactly one of the two edges is chosen into ; or only is matched in , in which case the edge from responsible for this must be the edge in some pathlike and this edge is chosen into whereas remains unmatched also in .
Now we will use our matching to find a choice that violates Property A) in the description of . More specifically, we choose our as follows.
First, for each edge , choose such that the tritwise sum of and is . This is possible, as , due to the definition of and the fact that . Moreover, since is a matching with the property described above, the above rule fixes each at most once, and it fixes for all . Second, consider all that satisfy . Due to the definition of , all these satisfy . For each such , find a partner index such that , and all these are distinct and have not been used in the above (i.e., is not fixed yet). Since, by Lemma 1, has a multiplicity of at least in , this is possible. Now, for each such , choose such that the tritwise sum of and is (which, again, is possible due to ). Finally, for each for which is not fixed yet, set (which is possible as all leftover satisfy ).
Since each is equal to or has a unique partner such that the tritwise sum of the two is , the above choice of the ensures that, for each , the number of that have a [math] at position is exactly the same as the number of that have a at position . By the definition of , it follows that , yielding a contradiction. Hence, there exists some set such that .
In order to use to obtain a set as specified in the lemma, partition into two sets and , where and . Consider and . By the definition of the function , each has to satisfy , and each has to satisfy . Since , it follows that is actually the disjoint union of and . Hence, we have , which implies that or .
Now, if , set , otherwise set . By our above observations, we conclude that satisfies the properties stated in the lemma. ∎
The following lemma shows that superweak -coloring is a relaxation of our problem derived from superweak -coloring, if is sufficiently large compared to . The proof relies heavily on Lemma 2.
Lemma 3**.**
Let be superweak -coloring for some , and let be an arbitrary non-negative integer. Let be a graph class such that every contained graph is regular with (maximum) degree at least and the given input includes an orientation for each edge. Set . If there is an algorithm that solves in time , then there is also an algorithm that solves superweak -coloring on in time .
Proof.
Let be an algorithm that solves in time . We will construct an algorithm that solves superweak -coloring on in time as follows. For executing each node first executes and then applies a function that takes the obtained outputs at all , where is incident to , and the input information and port numbers knows from the very beginning as input and returns new outputs for all the . As applying the function requires no communication between the nodes, has the same runtime as . In the following we describe the function, i.e., how transforms an arbitrary output for (at all ) into an output for superweak -coloring.
Let and , and denote the maximum degree of by . Let denote the multiset that contains, for each edge incident to , the output of at . Since solves , we know that . Let denote the edges incident to according to the respective port numbers at , and assume w.l.o.g. that is the output at . We start by assigning a value to each index by setting if edge is oriented towards , and if edge is oriented away from . Define values , by setting if , and if . Let denote the multiset . Let be the set of all possible , i.e., .
Recall that each output for the problem of superweak -coloring is a pair . Let denote the output will assign to according to . For the first entry , we find an arbitrary injective function in some fixed deterministic way (i.e., all nodes use the same function), and then set for all incident to . Since each element is a multiset of cardinality in which, by Lemma 1, all contained sets together have multiplicity at most , and since each contained set is a set of trit sequences of length (of which there are distinct ones), there are most distinct elements in , by the definition of . Hence, an injective function as described above exists.
For the second entry , node computes a set as described in Lemma 2 (and whose existence is guaranteed by Lemma 2) in a fixed deterministic way. We require that the multiset does not depend on the order of the pairs fixed by the port numbers, i.e., that any two nodes with obtain the same multiset by their choice of . However, since whether a set has the properties described in Lemma 2 depends only on the multisets and (due161616Note that, for the first equality, we use the fact that, for all , we have since all indices satisfy whereas , by Lemma 1. to the definition of in Lemma 2), such a choice for exists. Now, set if , if , and if . Note that , by Lemma 2.
We argue that the algorithm specified by the above indeed solves superweak -coloring on . Clearly outputs the same color at each according to . Moreover, the number of demanding pointers that outputs is strictly larger than the number of accepting pointers since , by Lemma 2; also, by Lemma 1, the latter number is at most since, for each , we have , by the definition of in Lemma 2 and the fact that . What is left to show is that there is no conflict on an edge, i.e., that for each edge , and output different colors at and , or both output , or at least one of them outputs .
Hence, consider the case that and output the same color at and , and that one of the two, say , also outputs at . By the definition of , this implies that the multisets and are equal (due to the injectivity of the above function), and that the set that outputs at satisfies , where is the index set computed by . Let be the port number for at , and the port number for at , and add the sub- or superscript , resp. , to the usual definitions to specify which node they correspond to. As (by the correctness of ) and , we see that there is some index with and , due to . Since (due to and the specified way of choosing ), there must be an index such that and . Observe that there cannot be two indices with and , due to the definition of the function . Hence, we obtain , which implies that outputs at , by the definition of . ∎
Due to our speedup results in Section 4, the following speedup lemma for superweak -coloring is essentially a corollary of Lemma 3. As the setting for weak -coloring used by Naor and Stockmeyer includes unique IDs, we formulate Lemma 4 for order-invariant algorithms. An analogous speedup lemma can be achieved for general algorithms in a setting without unique IDs.
Lemma 4**.**
Let , and fix some . Let be a -input-labeled graph class such that every contained graph has nodes, is -regular and the given input includes an orientation for each edge. Assume that is -independent for some positive integer and contains only graphs of girth at least . Let be a finite set of identifiers satisfying . Let be the class consisting of all (input-labeled) graphs obtained by taking a graph from and assigning unique identifiers from as inputs to the nodes of the graph. Set . If there is an order-invariant algorithm solving superweak -coloring on in time , then there is also an order-invariant algorithm solving superweak -coloring on in time .
Proof.
Set to be superweak -coloring and let be an order-invariant algorithm solving in time . Then, there is an order-invariant algorithm solving in time , by Theorem 3. It follows that there is also an algorithm solving superweak -coloring on in time , by Lemma 3. The order-invariance of follows from the proof of Lemma 3, or, more precisely, from the fact that , as defined in that proof, only takes the outputs of (the order-invariant algorithm) and the initial input information into account. ∎
5.2 Proving the Lower Bound
In this section, we show that there is no algorithm solving weak -coloring in time in odd-degree graphs in the setting used by Naor and Stockmeyer [24], or in the model [22, 25]. Those two models differ from the port numbering model we use in that they provide each node with a globally unique ID, where the model (commonly) uses IDs that are -bit strings while Naor and Stockmeyer assume arbitrarily large IDs. As the setting with a bound on the size of the IDs, i.e., the model, clearly makes proving a lower bound harder, we will formally prove our lower bound for the model.
As mentioned in Section 2.2, Naor and Stockmeyer themselves provide a tool to circumvent the complications arising from unique IDs: as they show in their work, if there is a constant-time algorithm for some problem, then there is also an order-invariant algorithm for the same problem with the same runtime. We will make use of this fact in the proof of Theorem 4.
Theorem 4**.**
There is no -time algorithm solving weak -coloring in odd-degree graphs.
Proof.
Assume for a contradiction that such an -time algorithm exists, and, for each odd , let denote the (worst-case) runtime of on -regular graphs. We can assume w.l.o.g. that for all (by choosing our algorithm suitably). Fix to be an odd positive integer such that (such a must exist since ).
Let be the (-input-labeled) graph class consisting of all -regular graphs with nodes, girth at least , and arbitrary edge orientations as inputs, where is a sufficiently large even constant (in particular, the ID space, which depends on , has to be of cardinality at least since we want to apply Lemma 4). Let be the class consisting of all graphs obtained by taking a graph from and assigning unique -bit IDs as inputs to the nodes of the graph. Let be the graph class obtained from by removing the edge orientations from the input (as those are not part of the model). We will show that our lower bound already holds on . Note that the three defined graph classes are all non-empty as assured, e.g., by [8, Chapter III, Theorem ].
Since, by assumption, solves weak -coloring on in time , and since is a fixed constant, there must also be an order-invariant algorithm doing the same, by [24, Theorem 3.3] (or, more precisely, by the proof thereof). Hence, in the following assume that is order-invariant. Clearly, our order-invariant algorithm also solves weak -coloring on in time , by simply ignoring the additional input information. As argued in Section 4.6, any algorithm that solves weak -coloring can be transformed into an algorithm that solves the pointer version of weak -coloring and requires at most one additional round. It is straightforward to check that this transformation also preserves order-invariance; for simplicity, denote the algorithm obtained after the transformation also by . Moreover, any algorithm that solves the pointer version of weak -coloring also solves superweak -coloring, by the definition of the latter. Hence, it follows that solves superweak -coloring on in time .
Observe that the definition of our graph class ensures that is -independent. Hence we can use Lemma 4 to repeatedly speed up until we obtain an algorithm that solves superweak -coloring for some large in [math] rounds. Set , and define recursively , where . Since for all and any algorithm for superweak -coloring also solves superweak -coloring if , we see that applications of Lemma 4 result in a [math]-round algorithm that solves superweak -coloring on , where . Note that for all , which implies that the condition in Lemma 4 is satisfied in each application of the lemma. In the following we argue that such a [math]-round algorithm cannot exist.
Consider a node in some graph from , and assume that has only incoming edges at the first ports and only outgoing edges at the remaining ports. Let and be two IDs such that would output the same color according to if it has either of the two IDs as input. Such two IDs must exist by the pidgeonhole principle. Now assume that is given as input, and let be some edge incident to such that outputs at according to . Assume that has the same constraint on the incoming and outgoing edges as (which is possible due to our choice of ) and let be the input ID for , which implies that outputs the same color as . Moreover, since (due to ), there are at least one edge incoming at and one edge outgoing at such that outputs neither at nor at (since has at most pointers available). Clearly, there exists a port numbering at such that or . In this case, and output the same color, outputs at , and does not output at , which yields an incorrect output for superweak -coloring at . Hence, there is no -time algorithm as described in the theorem statement. ∎
6 Conclusion
In this work, we developed a new technique for determining (or at least bounding) the time complexity of a given distributed problem, based on the idea of automatically transforming the problem into a problem that can be solved exactly one round faster. We proved the viability of the technique by showing that it can be used for reproducing known results in a (semi-)automatic fashion and that it is powerful enough to facilitate answering a long-standing open question that has resisted proof attempts for 25 years. Given that the technique can be applied to any locally checkable problem, we expect many other problems to be solved by this technique. A first confirmation has already been given by the follow-up work of Balliu et al. [2], where the authors use our speedup technique to prove lower bounds for maximal matching and maximal independent set.
One main difficulty in applying our speedup technique lies in the fact that the complexity of the problem description increases substantially in each speedup step. We provided a simplification technique (maximization) and outlined two general approaches (relaxation for a lower bound, making a problem harder for an upper bound) that mitigate this problem. Are there other techniques for reducing the description complexity? Is there a good way to determine which parts of the description of a derived problem are important and which can be discarded? Is finding the correct lower bound only a matter of finding the right relaxation (and to which degree can this be automated)? Answering these questions would be an important step forward in establishing the presented speedup as a main technique for determining the time complexity of locally checkable problems.
Acknowledgments
I would like to thank Alkida Balliu, Dennis Olivetti, and Jukka Suomela for insghtful discussions about weak coloring.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Dana Angluin. Local and global properties in networks of processors (extended abstract). In Proceedings of the 12th Annual ACM Symposium on Theory of Computing, April 28-30, 1980, Los Angeles, California, USA , pages 82–93, 1980.
- 2[2] Alkida Balliu, Sebastian Brandt, Juho Hirvonen, Dennis Olivetti, Mikaël Rabie, and Jukka Suomela. Lower bounds for maximal matchings and maximal independent sets. Co RR , abs/1901.02441, 2019.
- 3[3] Alkida Balliu, Sebastian Brandt, Dennis Olivetti, and Jukka Suomela. Almost global problems in the LOCAL model. In 32nd International Symposium on Distributed Computing, DISC 2018, New Orleans, LA, USA, October 15-19, 2018 , pages 9:1–9:16, 2018.
- 4[4] Alkida Balliu, Juho Hirvonen, Janne H. Korhonen, Tuomo Lempiäinen, Dennis Olivetti, and Jukka Suomela. New classes of distributed time complexity. In Proceedings of the 50th Annual ACM SIGACT Symposium on Theory of Computing, STOC 2018, Los Angeles, CA, USA, June 25-29, 2018 , pages 1307–1318, 2018.
- 5[5] Alkida Balliu, Juho Hirvonen, Dennis Olivetti, and Jukka Suomela. Hardness of minimal symmetry breaking in distributed computing. Co RR , abs/1811.01643, 2018.
- 6[6] Leonid Barenboim and Michael Elkin. Distributed (delta+1)-coloring in linear (in delta) time. In Proceedings of the 41st Annual ACM Symposium on Theory of Computing, STOC 2009, Bethesda, MD, USA, May 31 - June 2, 2009 , pages 111–120, 2009.
- 7[7] Leonid Barenboim and Michael Elkin. Deterministic distributed vertex coloring in polylogarithmic time. In Proceedings of the 29th Annual ACM Symposium on Principles of Distributed Computing, PODC 2010, Zurich, Switzerland, July 25-28, 2010 , pages 410–419, 2010.
- 8[8] Béla Bollobás. Extremal Graph Theory . 1978.
