Graph Reconstruction in the Congested Clique
Pedro Montealegre, Sebastian Perez-Salazar, Ivan Rapaport, Ioan, Todinca

TL;DR
This paper studies the problem of reconstructing graphs in the congested clique model, proving tight bounds on communication complexity and presenting algorithms that achieve these bounds in one or two rounds.
Contribution
It introduces optimal algorithms for graph reconstruction in the congested clique model, matching lower bounds with minimal rounds of communication.
Findings
Optimal two-round algorithm for general graph classes.
One-round algorithm for hereditary graph classes.
Communication complexity matches the lower bound.
Abstract
The congested clique model is a message-passing model of distributed computation where the underlying communication network is the complete graph of nodes. In this paper we consider the situation where the joint input to the nodes is an -node labeled graph , i.e., the local input received by each node is the indicator function of its neighborhood in . Nodes execute an algorithm, communicating with each other in synchronous rounds and their goal is to compute some function that depends on . In every round, each of the nodes may send up to different -bit messages through each of its communication links. We denote by the number of rounds of the algorithm. The product , that is, the total number of bits received by a node through one link, is the cost of the algorithm. The most difficult problem we could attempt to solve is the reconstruction…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Graph Reconstruction in the Congested Clique
Pedro Montealegre111Facultad de Ingeniería y Ciencias, Univ. Adolfo Ibáñez, Santiago, Chile, [email protected]
Sebastian Perez-Salazar222DIM-CMM (UMI 2807 CNRS), Univ. de Chile, Santiago, Chile, {sperez,rapaport}@dim.uchile.cl
Ivan Rapaport††footnotemark:
Ioan Todinca333Univ. Orléans, INSA Centre Val de Loire, LIFO EA 4022, Orléans, France, [email protected]
Abstract
The congested clique model is a message-passing model of distributed computation where the underlying communication network is the complete graph of nodes. In this paper we consider the situation where the joint input to the nodes is an -node labeled graph , i.e., the local input received by each node is the indicator function of its neighborhood in . Nodes execute an algorithm, communicating with each other in synchronous rounds and their goal is to compute some function that depends on . In every round, each of the nodes may send up to different -bit messages through each of its communication links. We denote by the number of rounds of the algorithm. The product , that is, the total number of bits received by a node through one link, is the cost of the algorithm.
The most difficult problem we could attempt to solve is the reconstruction problem, where nodes are asked to recover all the edges of the input graph . Formally, given a class of graphs , the problem is defined as follows: if , then every node must reject; on the other hand, if , then every node must end up, after the rounds, knowing all the edges of . It is not difficult to see that the cost of any algorithm that solves this problem (even with public coins) is at least , where is the subclass of all -node labeled graphs in . In this paper we prove that previous bound is tight and that it is possible to achieve it with only rounds. More precisely, we exhibit (i) a one-round algorithm that achieves this bound for hereditary graph classes; and (ii) a two-round algorithm that achieves this bound for arbitrary graph classes. Later, we show that the bound cannot be achieved in one-round for arbitrary graph classes, and we give tight algorithms for that case.
From (i) we recover all known results concerning the reconstruction of graph classes in one round and bandwidth : forests, planar graphs, cographs, etc. But we also get new one-round algorithms for other hereditary graph classes such as unit disc graphs, interval graphs, etc. From (ii), we can conclude that any problem restricted to a class of graphs of size can be solved in the congested clique model in two rounds, with bandwidth . Moreover, our general two-round algorithm is valid for any set of labeled graphs, not only for graph classes (which are sets of labeled graphs closed under isomorphims).
1 Introduction
The congested clique model –a message-passing model of distributed computation where the underlying communication network is the complete graph [20]– is receiving increasingly more attention [4, 6, 7, 8, 11, 12, 13, 15, 18, 22]. There are deep connections between the congested clique model and popular distributed systems such as the -machine model [17] or MapReduce [14]. Moreover, with the emergence of large-scale networks, this model has started to be used in other areas such as distributed convex learning [1].
The congested clique model is defined as follows. There are nodes which are given distinct identities (IDs), that we assume for simplicity to be numbers between 1 and . In this paper we consider the situation where the joint input to the nodes is a graph . More precisely, each node receives as input an -bit boolean vector , which is the indicator function of its neighborhood in . Note that the input graph is an arbitrary -node graph, a subgraph of the communication network .
Nodes execute an algorithm, communicating with each other in synchronous rounds and their goal is to compute some function that depends on . In every round, each of the nodes may send up to different -bit messages through each of its communication links. When an algorithm stops every node must know . We call the output of the distributed algorithm. The parameter is known as the bandwidth of the algorithm. We denote by the number of rounds. The product represents the total number of bits received by a node through one link, and we call it the cost of the algorithm.
An algorithm may be deterministic or randomized. We distinguish two sub-cases of randomized algorithms: the private-coin setting, where each node flips its own coin; and the public-coin setting, where the coin is shared between all nodes. An -error algorithm that computes a function is a randomized algorithm such that, for every input graph , . In the case where as , we say that computes with high probability (whp).
Function defines the problem to be solved. A function corresponds to a decision problem (such as connectivity [13]). For other, more general types of problems, should be defined, in fact, as a relation. This happens, for instance, when we want to construct a minimum spanning tree [12], a 3-ruling set [15], all-pairs shortest-paths [6], etc.
The most difficult problem we could attempt to solve is the reconstruction problem, where nodes are asked to reconstruct the input graph . In fact, if at the end of the algorithm every node has full knowledge of , then it could answer any question concerning . (This holds because in the congested clique model nodes have unbounded computational power and the only cost is related to communication).
In centralized, classical graph algorithms, a widely used approach to cope with NP-hardness is to restrict the class of graphs where the input belongs. Consider, for instance, the coloring problem, where the goal is to determine the minimum number of colors that we can assign to the vertices of such that no two vertices sharing the same edge have the same color [10]. It is known that, if the input is restricted to the class of interval graphs, the coloring problem is polynomial [10]. Nevertheless, if we restrict it to planar graphs, the problem remains NP-complete [10]. We are going to use the same approach here, in the congested clique model. But, as we are going to explain later, surprisingly, the complexity of the reconstruction problem will only depend on the cardinality of the subclass of -node graphs in .
Formally, for any fixed set of graphs we are going to introduce two problems. The first one, the strong recognition problem -Strong-Rec, is the following.
Input:
An arbitrary graph
Output:
-Strong-Rec parameterized by
Recall that the output is computed by every node of the network. In other words, every node of an algorithm that solves -Strong-Rec must end up knowing whether belongs to ; and, in the positive cases, every node also finishes knowing all the edges of . Note that, in principle, could be defined as the set of all graphs.
We also define a weak recognition problem -Weak-Rec. This is a promise problem, where the input graph is promised to belong to . In other words, for graphs that do not belong to , the behavior of an algorithm that solves -Weak-Rec does not matter.
Input:
Output:
all the edges of
-Weak-Rec parameterized by
For any positive integer we define as the set of -node graphs in . There is an obvious lower bound for , even for the weak reconstruction problem -Weak-Rec and even in the public-coin setting. In fact, . This can be easily seen if we note that, in the randomized case, there must be at least one outcome of the coin tosses for which the correct algorithm reconstructs the input graph in at least of the cases. Therefore, . The value corresponds to the total number of bits received by any node of the network: bits are received from the other nodes and bits are known by at the beginning of the algorithm (this is the indicator function of its neighborhood). This implies that . In this paper we are going to prove that this bound is tight even with (if is an hereditary class of graphs) and (in the general case).
We point out that our reconstruction algorithms may be applied not only to itself but also to some subgraph of . For instance, consider the situation where we generate a new graph by performing (locally) a random sampling on the edges of . Since typically belongs to a smaller class of graphs (whp), reconstructing may result in an efficient strategy to infer some properties of [26].
1.1 Our Results
We start this paper by studying a very natural family of graph classes known as hereditary. A class is hereditary if, for every graph , every induced subgraph of also belongs to . Many graph classes are hereditary: forests, planar graphs, bipartite graphs, -colorable graphs, bounded tree-width graphs, -degenerate graphs, etc. [5]. Moreover, any intersection class of graphs –such as interval graphs, chordal graphs, unit disc graphs, etc.– is also hereditary [5].
In Section 3 we give, for every hereditary class of graphs , a one-round private-coin randomized algorithm that solves -Strong-Rec with bandwidth
[TABLE]
We emphasize that our algorithm runs in one-round, and therefore it runs in the broadcast congested clique, a restricted version of the congested clique model where, in every round, the messages sent by a node must be the same. (This equivalence will be explained in Section 2). We also remark that for many hereditary graph classes, including all classes listed above, our algorithm is tight. Moreover, our result implies that -Strong-Rec can be solved in one-round with bandwidth when is the class of forests, planar graphs, interval graphs, unit-circle graphs, or any other hereditary graph class such that .
In Section 4 we give a very general result, showing that two rounds are sufficient to solve -Strong-Rec in the congested clique model, for any set of graphs . More precisely, we provide a two-round deterministic algorithm that solves -Weak-Rec and a two-round private-coin randomized algorithm that solves -Strong-Rec whp. We also give a three-round deterministic algorithm solving -Strong-Rec. All algorithms run using bandwidth , so they are asymptotically optimal when .
Our result implies, in particular, that -Strong-Rec can be solved in two rounds with bandwidth , when is any set of graphs of size . The only property of the set of graphs used by our algorithm is the cardinality of . Our algorithm does not require to be closed under isomorphisms.
In Section 5 we revisit the one-round case. We show that our general algorithm can be adapted to run in one round (i.e., in the broadcast congested clique model) by allowing a larger bandwidth, and then we show that this is tight. More precisely, we show that, for every set of graphs , there is a one-round deterministic algorithm that solves -Weak-Rec, and a one-round private-coin algorithm that solves -Strong-Rec whp, both of them using bandwidth .
Then we show that there are classes of graphs satisfying that such that every algorithm (deterministic or randomized) that solves -Weak-Rec in the broadcast congested clique model has cost . Therefore, with respect to the bandwidth, our general one-round algorithms for solving -Weak-Rec and -Strong-Rec are tight (up to a logarithmic factor).
Our one-round algorithm that solves -Strong-Rec uses private coins. Is it possible to achieve the same deterministically? Our last result gives a negative answer to this question. Consider, for a set of graphs , the recognition problem -Recognition, which consists in deciding whether the input graph belongs to . We show that there exists a set of graphs , satisfying , such that any one-round deterministic algorithm that solves -Recognition requires bandwidth . Clearly, the same lower-bound is valid for any deterministic algorithm that solves -Strong-Rec. This is far from our bandwidth .
1.2 Related Work
All known results concerning the reconstruction of graphs have been obtained in the context of hereditary graph classes. For instance, let be the class of cograph, that is, the class of graphs that do not contain the 4-node path as an induced subgraph. This class is obviously hereditary. In [16], the authors presented a one-round public-coin algorithm that solves -Strong-Rec with bandwidth . Note that . Therefore, the result we get in this paper is stronger, because our one-round algorithm needs the same bandwidth but uses private coins.
In [3, 21] it is shown that, if is the class of -degenerate graphs, then there is a one-round deterministic algorithm that solves -Strong-Rec with bandwidth . A graph is -degenerate if one can remove from a vertex of degree at most , and then proceed recursively on the resulting graph , until obtaining the empty graph. Note that planar graphs (or more generally, bounded genus graphs), bounded tree-width graphs, graphs without a fixed graph as a minor, are all -degenerate, for some constant . Since the class of -degenerate graphs is hereditary and satisfies , it follows, from this paper, the existence of a one-round private-coin randomized algorithm that solves -Strong-Rec with bandwidth . However, the result of [3] for this particular class is stronger, since their algorithm is deterministic.
Another example of reconstruction with one-round algorithms can be found in [8]. There, the authors consider the class of graphs defined by one forbidden subgraph . They show that such classes can be reconstructed deterministically with cost , where is the Turán number, defined as be the maximum number of edges in an -node graph not containing an isomorphic copy of as a subgraph. For example, if is the cycle of length 4, then . This implies that, if we define as the class of graphs not containing as a subgraph, then there is a one-round deterministic algorithm that solves -Strong-Rec with bandwidth .
2 Preliminaries
2.1 Some Graph Terminology
Two graphs and are isomorphic if there exists a bijection such that any pair of vertices are adjacent in if and only if and are adjacent in . A class of graphs is a set of graphs which is closed under isomorphisms, i.e., if belongs to and is isomorphic to , then also belongs to . For a class of graphs and , we call the subclass of -node graphs in .
For a graph and we denote the subgraph of induced by . More precisely, the vertex set of is and the edge set consists of all of the edges in that have both endpoints in . A class of graphs is hereditary if it is closed under taking induced subgraphs, i.e., for every and every , the induced subgraph .
For a graph , we call its adjacency matrix, i.e., the 0-1 square matrix of dimension where if and only if is adjacent to . Let be a square matrix of dimension , and let . We call the -th row of . Let be another square matrix of dimension . We denote by the row-distance between and , that is, the number of rows that are different between and . In other words, . For and , let us call the set of all graphs such that .
2.2 One-Round Algorithms in the Congested Clique
The broadcast congested clique is a restricted version of the congested clique model where each node is forced, in each round, to send the same message through its communication links. But, if we consider one-round algorithms, the two models are the same. In fact, suppose that there is a one-round algorithm (deterministic or randomized) in the congested clique with bandwidth . We can transform it into an algorithm in the broadcast version with bandwidth as follows. We fix a vertex, say the one with ID , and every node broadcasts the message it would send to node on algorithm , plus one bit indicating whether node and node are adjacent in . After this communication round of , every node knows the messages node would have received after the communication round of algorithm . Moreover, every node knows the neighborhood of node . The result follows from the fact that, with this information, node 1 knows the output. Obviously, as we will see in this paper, when multi-round algorithms are considered, the broadcast congested clique model is much less powerful than the congested clique model.
2.3 Fingerprints
The following technique, that we call fingerprints, is based on a result known as the Schwartz Zippel Lemma, used in verification of polynomial identities [25]. Let be a positive integer and be a prime number. In the following, we denote by the finite field of size (we refer to the book of Lidl and Niederreiter [19] for further details and definitions involving finite fields). A polynomial of degree is an expression of the form , where and for each . We denote by the polynomial ring on . An element is called a root of a polynomial if .
Let be a positive integer, and be two prime numbers such that . For each and , consider the polynomial defined as
[TABLE]
For , we call the fingerprint of and . Note in the last expression that the coordinates of are interpreted as elements of . The following lemma is direct. Since the proof is very short we include it here.
Lemma 1
[19]** Let be a positive integer, and be two prime numbers such that . Let such that . Then,
**Proof ** Note that implies that . Since is a polynomial of degree at most in , it has at most roots in . Therefore .
We extend the definition of fingerprints to matrices. Let be a square matrix of dimension and coordinates in , and let be an element of . We call the fingerprint of and , defined as , where is the -th row of , for each . Moreover, for a graph of size , and we call the fingerprint of and .
3 Reconstructing Hereditary Graph Classes in One Round
In this section we start giving the positive result. Later we explain the consequence of this result on well-known hereditary graph classes.
Theorem 1
Let be an hereditary class of graphs. There exists a one-round private-coin algorithm that solves -Strong-Rec whp and bandwidth .
**Proof ** In the algorithm, nodes use a prime number , whose value will be chosen later. The algorithm consists in: (1) Each node picks in uniformly at random (using private coins), and computes . (2) Each node communicates and . (3) Every node constructs and from the messages sent in the communication round. Finally: (4) Every node looks in for a graph such that . If such graph exists, the algorithm outputs , otherwise it rejects. The description of the algorithm is given in Algorithm 1.
Now we aim to show that, if satisfies , then whp. Let in , picked uniformly at random. Then,
[TABLE]
Suppose that and let such that belongs to . Then, from Lemma 1, we deduce that . It follows that
[TABLE]
We now claim that . Indeed, we can interpret a graph in as a graph built by picking vertices of and then adding or removing edges between those vertices. Since we are looking for graphs in , and is hereditary, the graph induced by must belong to . Therefore, . This claim implies that
[TABLE]
Let be defined as . Note that this function is increasing, satisfies , and . Therefore,
[TABLE]
We now fix as the smallest prime number greater than , and we deduce that
[TABLE]
Then, with probability at least , either or , for every . Hence, the algorithm solves -Strong-Rec whp.
Note that the bandwidth required by node in the algorithm equals the number of bits required to represent the pair , which are two integers in . Therefore, the bandwidth of the algorithm is
[TABLE]
Corollary 1
Let be an hereditary class of graphs, and be an increasing function such that . Then, our private-coin algorithm solves -Strong-Rec whp, in one-round, with bandwidth . This matches the lower bound on the cost (which must be satisfied even in the public coin setting).
**Proof ** We simply note the existence of constants such that:
[TABLE]
Therefore, the algorithm of Theorem 1 uses bandwidth .
In [24], Scheinerman and Zito showed that hereditary graph classes have a very specific growing rate. They showed ([24], Theorem 1) that, for any hereditary class of graphs , one of the following behaviors must hold:
- •
is constant; meaning that for all sufficiently large.
- •
is polynomial, meaning that .
- •
is *exponential *, meaning that .
- •
is *factorial *, meaning that .
- •
is *super-factorial *, meaning that .
Corollary 1 implies that our algorithm is tight for any factorial hereditary class of graphs. For example, the class of forests, planar graphs, interval graphs, unit disc graphs, circle graphs, etc., are factorial. Therefore, the bandwidth required to reconstruct them in one-round is . Moreover, constant, polynomial and exponential hereditary classes can be also reconstructed with bandwidth .
Super-factorial hereditary classes of graphs might be more troublesome. Indeed, in [2] it is shown that there exist super-factorial hereditary classes such that the succession might oscillate, roughly, between and , for two constants . For these classes, the upper bound given by our algorithm does not match the lower bound . We remark, however, that there are also super-factorial classes of graphs where our algorithm is non-trivial and tight. For example, if is the class of chordal-bipartite graphs, we have that . Therefore, they can be reconstructed in one-round with bandwidth .
4 Reconstructing Arbitrary Graph Classes in Two Rounds
In this section we show that there exists a two-round private-coin algorithm in the congested clique model that solves -Strong-Rec whp and bandwidth . Our algorithm is based, roughly, on the same ideas used to reconstruct hereditary classes of graphs. But the problem we encounter is the following: while in the case of hereditary classes of graphs, we had for every graph and , a bound on the number of graphs contained in , this is not the case in an arbitrary family of graphs . Therefore, fingerprints alone are not able to differentiate graphs. To cope with this obstacle, we use Error Correcting Codes.
4.1 Error Correcting Codes
Consider the following technique, introduced by Reed and Solomon [23], originally used to produce safe communication in a noisy channel. (This technique has also been used in randomized protocols for multiparty communication complexity [9]).
Definition 1
Let , and let be the smallest prime number greater that . An error correcting code with parameters is a mapping , satisfying:
For every and , .
- 2)
For each , implies .
For sake of completeness, we give the construction of an error correcting code with parameters . For , let be the unique polynomial in satisfying for each . The function is then defined as . This function satisfies both property from the definition of , and property (2) because two different polynomials of degree can be equal in at most different values.
We now adapt the definition of error correcting codes to graphs.
Definition 2
For a graph , we call the square matrix of dimension with elements in defined as follows.
- •
For each , the -th row of is (recall that is the -th row of the adjacency matrix of ).
- •
For each , the -th row of is the vector , where is the zero-vector of , and is the -th element of .
We can represent as a pair , where belongs to . Similarly, for a graph , we can represent as the matrix:
[TABLE]
where is the matrix with rows , . Note that is symmetric.
Remark 1
Note that , for every two different -node graphs and . Indeed, if , there exists such that is different than . Then, by definition of , . This means that , because and are symmetric matrices.
4.2 Optimal Reconstruction of Arbitrary Graph Classes in Two Rounds
Lemma 2
Let be a set of graphs, the error correcting code with parameters , and let be the smallest prime number greater than . Then, there exists depending only on , satisfying for all different .
**Proof ** From the remark at the end of the last subsection, we know that , for every two different -node graphs and . Then, if we pick uniformly at random we have from Lemma 1:
[TABLE]
Then, by the union bound
[TABLE]
The last inequality follows from the choice of . Therefore, there must exist a such that , for all different .
Theorem 2
Let be a set of graphs. The following holds:
There exists a two-round deterministic algorithm in the congested clique model that solves -Weak-Rec with bandwidth .
- 2)
There exists a three-round deterministic algorithm in the congested clique model that solves -Strong-Rec with bandwidth .
- 3)
There exists a two-round private-coin algorithm in the congested clique model that solves -Strong-Rec with bandwidth whp.
**Proof ** The first algorithm we are going to explain here, Algorithm 2, is deterministic and solves -Weak-Rec with bandwidth . The algorithms for (2) and (3) are slight modifications of Algorithm 2 and will also be explained in this proof.
-
Let be the first prime greater than (then ), and let be the smallest prime number greater than . In the algorithm, node first computes , where is the error correcting code with parameters . Then, for each node communicates to node . This communication round requires bandwidth . After the first communication round, node knows and , i.e., it knows rows and of matrix . Each node computes a vector such that , for all different (each node computes the same ). The existence of is given by Lemma 2. Then, node communicates (broadcasts) and . This communication round requires bandwidth . After the second communication round, each node knows . Then, they locally compute the unique such that . Since belongs to , then necessarily .
-
Suppose now that we are solving -Strong-Rec. In this case does not necesarily belong to . After receiving the fingerprints of , nodes look for a graph in that satisfies (line 9 in Algorithm 2). If such a graph exists, we call it a candidate. Otherwise, every node decides that is not in , so they reject. Note that, if the candidate exists, then it is unique, since for all different , in . So, if the candidate exists, each node checks whether the neighborhood of vertex on and are equal, and announces the answer in the third round (communicating one bit). If every node announces affirmatively, then they output . Otherwise, it means that is not in , so every node rejects.
-
We now show that, if we allow the algorithm to be randomized, then we can spare the third round. In fact, nodes only need to run Algorithm 3 after the first round of Algorithm 2. Let us explain this now. Let be a prime number. In the second round, node picks , and it communicates, together with and , also . After the second round of communication, if a candidate exists, each node computes , . If , then nodes deduce that . Otherwise, they deduce that and rejects. Note that if belongs to , then the algorithm always give the correct answer. Otherwise, it rejects whp. Indeed, if , then , and from Lemma 1, .
Note that our private-coin algorithm for -Strong-Rec has one-sided error. In fact, if the input graph belongs to , then our algorithm reconstructs it with probability . On the other hand, if is not contained in , then our algorithm fails to discard the candidate with probability at most .
5 Revisiting the One Round Case
In this section we revisit the one-round case (and therefore the broadcast congested clique model). But instead of studying hereditary graph classes we study arbitrary graph classes, and we show that for this general case we need a larger bandwith. Our results are tight, not only in terms of the bandwidth, but also in the necessity of using randomization.
Theorem 3
Let be a set of graphs. The following holds:
There exists a one-round deterministic algorithm in the congested clique model that solves -Weak-Rec with bandwidth .
- 2)
There exists a two-round deterministic algorithm in the broadcast congested clique model that solves -Strong-Rec with cost .
- 3)
There exists a one-round private-coin algorithm in the congested clique model that solves -Strong-Rec with bandwidth whp.
**Proof ** The algorithm in this case is very similar to the one we provided in the proof of Theorem 2. Let be a parameter whose value will be chosen at the end of the proof, and let be the error-correcting-code with parameters . Let be the smallest prime number greater than . Let be the vector given by Lemma 2, corresponding to .
In the algorithm, every node computes , and communicates together with , where is the smallest prime greater than . Note that the communication round requires bandwidth
[TABLE]
After the communication round, every node knows , for all , and also knows the matrix . Therefore, every node can compute , for all , and, moreover, compute .
From the construction of , there is at most one graph such that . Therefore, if belongs to , every node can reconstruct it. On the other hand, if we are solving -Strong-Rec, then we proceed as in the algorithm of Theorem 2, either testing whether in one more round, or sending a fingerprint of to check with high probability if a candidate such that is indeed equal to . This verification requires to send more bits, which fits in the asymptotic bound of the bandwidth.
The optimal value of , that is, the one which minimizes the bandwidth, is such that . Threfore, the bandwidth is .
5.1 Tightness of our Algorithms
In this subsection we show that our algorithms for solving -Weak-Rec and -Strong-Rec are tight, from two different perspectives. First, from the point of view of the bandwidth, we show that there are classes of graphs satisfying such that every algorithm (deterministic or randomized) solving -Weak-Rec in the broadcast congested clique model has cost . This lower bound matches the upper one-round bound given in Theorem 3 (up to logarithmic factors).
Then, we show that, when restricted to one-round algorithms, the use of randomization is necessary in order to have non-trivial general algorithms solving -Strong-Rec. Indeed, we prove that there exists a set of graphs satisfying such that, every one-round deterministic algorithm that solves -Strong-Rec, requires bandwidth .
Theorem 4
There exists a class of graphs satisfying such that, any -error public-coin algorithm in the broadcast congested clique model that solves -Weak-Rec, has cost .
**Proof ** Let be the class of graphs defined as follows: belongs to if and only if is the disjoint union of a graph of nodes and isolated nodes. Note that . Indeed, there are labeled graphs of size , and at most different labelings of a graph of nodes using labels (so is closed under isomorphisms).
Let be an -error public-coin algorithm solving -Weak-Rec in rounds and bandwidth , on input graphs of size .
Consider now the following algorithm that solves -Weak-Rec, where is the set of all graphs: on input graph of size , each node supposes that it is contained in a graph formed by plus isolated vertices with identifiers . Note that belongs to . Then, node simulates as follows: at each round, node produces the message of node in according to . Note that the messages produced by nodes labeled do not depend on , so they can be produced by any node of . Since solves -Weak-Rec, at the end of the algorithm every node knows all the edges of , so they reconstruct ignoring vertices labeled .
We deduce that algorithm solves -Weak-Rec. Note that the cost of is on input graphs of size . We deduce that , i.e., the cost of is .
We say that an algorithm recognizes if the algorithm decides whether an input graph belongs to . We call -Recognition the problem of recognizing .
Theorem 5
There exists a set of graphs satisfying such that, and any one-round deterministic algorithm in the congested clique model that solves -Recognition, requires bandwidth .
**Proof ** We prove this theorem by a counting argument. Our goal is to show that there are more small sets of graphs than one-round deterministic algorithms capable to recognize them.
We first count the number of sets of graphs (not necessarily closed under taking isomorphism) containing different graphs of size . We call the family of these sets . There are possible graphs of size , so possible choices for graphs in . We deduce that there exists such that .
On the other hand, we count the number of one-round deterministic algorithms that recognize a set of graphs in with bandwidth at most . A one-round deterministic algorithm is composed of two parts: the algorithm before the communication round, and the algorithm after the communication. The first part of an algorithm is defined by the messages that a node sends on each input. The input of a node is its neighborhood represented by a Boolean vector of size , and an integer representing its label. Therefore, the first part of an algorithm is defined by the messages corresponding to all the possible inputs. Since the bandwidth is , we obtain that there are possible choices for the first part of an algorithm.
The second part of an algorithm is defined by a function , such that if are the messages sent by the nodes in the communication round, then if and only if was produced from an input graph belonging to . The crucial observation is that this implies that can output in at most inputs. Therefore, the number of possible second parts of an algorithm is , where is a constant.
We deduce that the number of one-round deterministic algorithms with bandwidth that are capable to recognize a set of graphs in is at most , with . Since we are considering only deterministic algorithms, two different sets must be recognized by two different algorithms. This implies that must be greater than , so .
Finally, we construct by picking, for each , one set of graphs contained in that can not be recognized by any algorithm of bandwidth .
Remark 2
Note that for any set of graphs , problem -Strong-Rec is at least as hard as -Recognition. We conclude that there exists a set of graphs satisfying such that, any one-round deterministic algorithm that solves -Strong-Rec, requires bandwidth . Note that, since in this case , from Theorem 3 we know that -Strong-Rec can be solved using a one-round private-coin algorithm with bandwidth whp.
6 Discussion
In this paper we showed that all graph classes can be optimally reconstructed in two rounds in the congested clique model. But our algorithm is randomized, it uses private-coins. A natural question is the following: is it possible to achieve the same deterministically? In other words, given an arbitrary graph class , is it always possible to solve -Strong-Rec with a two-round deterministic algorithm with bandwidth ? (Note that this is true for the weak version of the reconstruction problem -Weak-Rec).
We also restricted the reconstruction problem to one-round algorithms. We showed that, if is an hereditary graph class such as forests, planar graphs, interval graphs, unit disc graphs, chordal bipartite graphs, bounded tree-widh graphs, -degenerate graphs, etc., then -Strong-Rec can be solved, whp, with a one-round private-coin algorithm that uses bandwidth . Can we extend this result to every hereditary class of graphs?
A related problem is the recognition problem, where we simply want to decide whether the input graph belongs to the class . It seems that sometimes we can not solve the recognition problem without solving the reconstruction problem. This seems to be true in the case of trees and, more generally, in the case of -degenerate graphs. But this is not always the case. Sometimes, solving the recognition problem requires a much smaller bandwidth. For example, consider the class of split graphs. A split graph is a graph where the vertices can be partitioned into a clique and an independent set (these two sets are connected arbitrarily). The class of split graphs contains graphs of size , so it cannot be reconstructed with cost . However, split graphs can be characterized solely by their degree sequences (see [5]), so they can be recognized by a one-round deterministic algorithm, where each node sends its degree ( bits). It is an interesting challlenge to understand the cases where we can solve the recognition problem without solving the reconstruction problem.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Yossi Arjevani and Ohad Shamir. Communication complexity of distributed convex learning and optimization. In Adv. Neural Inf. Process. Syst. , pages 1756–1764, 2015.
- 2[2] József Balogh, Béla Bollobás, and David Weinreich. The penultimate rate of growth for graph properties. European Journal of Combinatorics , 22(3):277 – 289, 2001.
- 3[3] Florent Becker, Martín Matamala, Nicolas Nisse, Ivan Rapaport, Karol Suchan, and Ioan Todinca. Adding a referee to an interconnection network: What can(not) be computed in one round. In 25th IEEE International Parallel and Distributed Processing Symposium (IPDPS) , pages 508–514, 2011.
- 4[4] Andrew Berns, James Hegeman, and Sriram Pemmaraju. Super-fast distributed algorithms for metric facility location. Automata, Languages, and Programming , pages 428–439, 2012.
- 5[5] Andreas Brandstädt, Van Bang Le, and Jeremy P Spinrad. Graph classes: a survey . SIAM, 1999.
- 6[6] Keren Censor-Hillel, Petteri Kaski, Janne H. Korhonen, Christoph Lenzen, Ami Paz, and Jukka Suomela. Algebraic methods in the congested clique. In ACM Symposium on Principles of Distributed Computing (PODC) , pages 143–152, 2015.
- 7[7] Danny Dolev, Christoph Lenzen, and Shir Peled. “Tri, tri again”: Finding triangles and small subgraphs in a distributed setting - (ext. abstract). In 26th International Symposium on Distributed Computing (DISC) , pages 195–209, 2012.
- 8[8] Andrew Drucker, Fabian Kuhn, and Rotem Oshman. On the power of the congested clique model. In ACM Symposium on Principles of Distributed Computing (PODC) , pages 367–376, 2014.
