Thresholds in random motif graphs
Michael Anastos, Peleg Michaeli, Samantha Petti

TL;DR
This paper generalizes the Erdős-Rényi model to include fixed motifs, establishing thresholds for key properties like connectivity and Hamiltonicity, and providing hitting time results in the random motif graph process.
Contribution
It introduces the binomial random motif graph model and determines thresholds for several fundamental graph properties, extending classical results to motif-based random graphs.
Findings
Thresholds for connectivity, Hamiltonicity, and perfect matching are established.
Hitting time results for the emergence of these properties are proved.
The model generalizes Erdős-Rényi graphs by incorporating fixed motifs.
Abstract
We introduce a natural generalization of the Erd\H{o}s-R\'enyi random graph model in which random instances of a fixed motif are added independently. The binomial random motif graph is the random (multi)graph obtained by adding an instance of a fixed graph on each of the copies of in the complete graph on vertices, independently with probability . We establish that every monotone property has a threshold in this model, and determine the thresholds for connectivity, Hamiltonicity, the existence of a perfect matching, and subgraph appearance. Moreover, in the first three cases we give the analogous hitting time results; with high probability, the first graph in the random motif graph process that has minimum degree one (or two) is connected and contains a perfect matching (or Hamiltonian respectively).
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Thresholds in Random Motif Graphs
Michael Anastos
Michael Anastos
Carnegie Mellon University
[email protected] http://www.math.cmu.edu/ manastos ,
Peleg Michaeli
Peleg Michaeli
School of Mathematical Sciences, Raymond and Beverly Sackler Faculty of Exact Sciences, Tel Aviv University, Tel Aviv 6997801, Israel.
[email protected] http://www.math.tau.ac.il/ pelegm and
Samantha Petti
Samantha Petti
School of Mathematics, Georgia Institute of Technology, Atlanta, GA, USA.
[email protected] https://math.gatech.edu/people/samantha-petti
Abstract.
We introduce a natural generalization of the Erdős-Rényi random graph model in which random instances of a fixed motif are added independently. The binomial random motif graph is the random (multi)graph obtained by adding an instance of a fixed graph on each of the copies of in the complete graph on vertices, independently with probability . We establish that every monotone property has a threshold in this model, and determine the thresholds for connectivity, Hamiltonicity, the existence of a perfect matching, and subgraph appearance. Moreover, in the first three cases we give the analogous hitting time results; with high probability, the first graph in the random motif graph process that has minimum degree one (or two) is connected and contains a perfect matching (or Hamiltonian respectively).
S. Petti: This material is based upon work supported by the National Science Foundation Graduate Research Fellowship under Grant No. DGE-1650044
1. Introduction
In the late 1950’s Gilbert [Gil59] and Erdős and Rényi [ER59] introduced two of the most fundamental models for generating random graphs: the binomial random graph , generated by independently adding an edge between each pair of vertices in the complete graph on vertices with probability , and the the uniform random graph , which is a uniformly chosen graph from all graphs on vertices with edges. Since, the extensive study of these simple constructions has influenced a variety of fields including combinatorics, computer science, and statistical physics (see [FK, Bol, JLR] for surveys).
Detailed analysis of the model has led to the development of plethora of new techniques in probability for analyzing random processes, and the model has been used to verify the existence of structures with certain properties [AS]. In computer science, the model has been used to analyze the performance of algorithms on an “average” case, showing that NP complete problems may be easier random instances.
The rise of data in the form of graphs (e.g. internet connections, biological networks, social networks) has further fueled the study of random graphs. In practice, the comparison of real world networks to the Erdős-Rényi model is a popular technique for highlighting the non-random aspects of a network’s structure [Yeg04, Alo07, Son05, Mil02]. Moreover, the model has inspired many other models which are designed to mirror some characteristic of real-world networks (e.g. Watts-Strogatz graphs have small diameter [WT98], Barabási-Albert preferential attachment graph exhibit a power law degree distribution [BA99]).
In this paper we consider a natural generalization of the Erdős-Rényi model in which random motifs are added rather than random edges. A motif is a fixed small subgraph, such as a triangle. The motifs that are overrepresented in a network are correlated to the function of the network [Yeg04, Alo07, Son05, Mil02]. Analyzing random graphs formed as the union of many instances of a particular motif will give insight into the structural properties of networks with many copies of the motif .
We define the binomial random motif graph as the random (multi)graph obtained by adding an instance of on each of the copies of in the complete graph on vertices , independently with probability . Here by we denote the number of automorphisms of . Note that if is an edge, then this is exactly . Similarly, the uniform random motif graph is the random (multi)graph obtained by taking the union of uniformly chosen copies of in without replacement.
Closely related to is the random motif graph process . is the empty graph on vertices and for the graph is generated by adding to a copy of , , chosen uniformly at random from all the copies of except those in i.e. those that have been added to so far. Clearly has the same law as . In addition, by setting to be an edge we retrieve the random graph process introduce by Erdős and Rényi [ER60]. By considering the random motif graph process in place of the uniform random motif graph model we can phrase results in a finer way (see for example Theorem 3).
In this work we show that every monotone graph property has a threshold in the binomial random motif graph . Then we determine the thresholds for connectivity, existence of a perfect matching, Hamiltoncity and subgraph appearance. In the first three cases we also show a hitting time result, according to which w.h.p.111That is, with probability tending to as tends to infinity. the first graph in the random motif graph process that has minimum degree one (or two) is connected (or Hamiltonian respectively).
1.1. Notation
Throughout we assume the motif has no isolated vertices. For an integer , denote by the number of its copies in which intersect the set . For an integer we define the quantities and by
[TABLE]
where is any function of satisfying . Note that the expected number of added instances of in is , which only depends on and on .
1.2. Results
A function is a threshold for a monotone increasing property in the random graph if
[TABLE]
as . Our first result is a generalization of a theorem by Bollobás and Thomason [BT97].
Theorem 1**.**
Every non-trivial monotone graph property has a threshold.
Given Theorem 1, a natural goal is to find the thresholds for various monotone properties. The remaining results of this paper are dedicated towards this goal; we determine the threshold for connectivity, the existence of a perfect matchings, Hamiltonicity, and subgraph appearance.
A first such result, which generalizes a result in [ER59], shows, in particular, that the expected number of motifs needed to make the random motif graph connected depends only on the number of (non-isolated) vertices of the motif.
Theorem 2**.**
Let be a fixed graph. Then
[TABLE]
In fact, we show a hitting time result, according to which the hitting time of connectivity equals, w.h.p., the hitting time of minimum degree one. In other words, the random motif graph process becomes connected exactly when the last isolated vertex disappears, with high probability.
Fix an integer and a graph . Let , and for denote .
Theorem 3**.**
Let be a fixed graph. Then w.h.p. .
We remark that if the motif is connected, every connectivity related question depends solely on the sets of vertices on which copies of are added, and not on the way they are put there. Thus, we may model the question as a (binomial or uniform) random -uniform hypergraph, where . In this case, Theorems 2 and 3 follow immediately from known results about (loose) connectivity in random hypergraphs (see, e.g.,[Poo15]).
In the following two theorems we show that the existence of a perfect matching is also dependent on the number of non-isolated vertices of the motif.
Theorem 4**.**
Let be a fixed graph, and assume that is even. Then,
[TABLE]
Let . The analogue hitting time result is also true.
Theorem 5**.**
Let be a fixed graph, and assume that is even. Then w.h.p. .
Theorem 6 establishes that the thresholds for minimum degree and for Hamiltonicity are the same. Theorem 7 shows the hitting time version of that result.
Theorem 6**.**
Let be a fixed graph. Then
[TABLE]
Let .
Theorem 7**.**
Let be a fixed graph. Then w.h.p. .
Next, we describe the threshold for the appearance of a subgraph . If appears in a random motif graph, then is a subgraph of some configuration of copies of whose union contains vertices. For such an covering of , we call a subset of the covering containing copies of whose union contains vertices an subset. The threshold for the appearance of depends on , the maximum over all covering configurations of the minimum ratio for all subsets of the covering configuration. Definition 15 formally describes .
Theorem 8**.**
Let be a fixed graph, let be a fixed graph, and set and . Then
[TABLE]
The number of excess edges of a connected graph , or simply its excess, is defined to be . In particular, trees have excess [math]. We say that is unicyclic if its excess is , or complex if its excess is at least . The following theorem gives a simple description of when the motif is a path, which allows us to deduce how the copies of fit together to form a copy of at the threshold when first appears. If is a tree, a minimal set of edge disjoint copies of typically forms . If is complex, each copy of the path typically contributes a single edge to . If it is unicyclic, it may be formed by any edge disjoint configuration of paths .
Theorem 9**.**
Let be a path of length and let be a connected graph. Let be the minimum number of edge-disjoint copies of whose union contains as a subgraph. Let . Then
[TABLE]
In the case where the motif is a long path, this result establishes a connection between the threshold for the appearance of subgraphs in random motif graphs and the threshold for the appearance of subgraphs in the trace of a random walk on the complete graph (studied in [KM17]). Let be a connected graph and be the minimum number of paths in any edge-disjoint decomposition of into paths. If is longer than the maximum length path in such a minimum edge-disjoint path decomposition, then the threshold implied by Theorem 9 matches the threshold for the appearance of in the trace of a random walk on the complete graph [KM17].
This should not come as a surprise; by noticing that when the motif is a long path, the random motif graph model approximates the trace model, in the following sense. One may sequentially “cut” the (lazy) simple random walk into chunks with buffers of length . We delete loops created by the trace of each chunk, and we enforce the condition that the remaining edges span a path of length (which is fixed but large). Hence the trace of each such chunk is an independent copy of a path of length . Thus we may couple the trace model and the random motif model such that the trace model will include the random motif model plus some loops plus a small number of buffer edges (which gets smaller as gets larger).
Viewing this analogy this way, we may use Theorems 8 and 9 to reprove the main theorems of [KM17] for the case where the base graph is complete.
2. Existence of thresholds for monotone properties
Proof of Theorem 1.
Assume that is a monotone increasing property and let be the copies of that are spanned by . Observe that
[TABLE]
is a polynomial in . In addition, since is increasing, it is increasing. Therefore we may define by
[TABLE]
We will show that is a threshold for . For two random graphs we write if can be coupled such that is a subgraph of .
First let where as and let . Let be distributed as a for . Then, by considering the probability of no appearance of a fixed copy of , we have that the graph is distributed as . Thereafter implies,
[TABLE]
Hence,
[TABLE]
Now assume that for some as and let . Similarly to before, if we let to be distributed as a for then, we have that
[TABLE]
Hence,
[TABLE]
Rearranging the above gives,
[TABLE]
3. Connectivity
Proof of Theorem 2.
If then by Theorem 19 the minimum degree of is w.h.p. [math], hence it is not connected.
Suppose . In fact, for the argument below, we only assume that (and the conclusion will follow by monotonicity). Let denote the number of vertices of . For denote by the number of connected components of size in . Note that for , if a set of cardinality is a connected component, then there exist copies of inside the set which appear in , and there are no edges between it and its complement, so none of the copies of that intersect that set appear. By Lemma 17,
[TABLE]
Let and suppose . By Lemma 18 and by the union bound there exist constants depending only on such that
[TABLE]
It follows that
[TABLE]
but according to Theorem 19 (for ), there are no isolated vertices w.h.p., and the result follows. ∎
Note that a consequence of this proof is that for , with high probability, every connected component is of cardinality or at least . This means that w.h.p. there exists a unique “giant” component of linear size, and the rest of the vertices are isolated. The next lemma, whose proof uses a simple second moment argument, estimates the number of these isolated vertices for .
Lemma 10**.**
The number of isolated vertices in is w.h.p. at most .
Proof.
Let be the number of isolated vertices in . First,
[TABLE]
Moreover,
[TABLE]
Denote . Thus
[TABLE]
and since , we have that
[TABLE]
Thus, noting that ,
[TABLE]
Proof of Theorem 3.
Denote and . By asymptotic equivalence of the binomial and the uniform models (see, e.g., [JLR]*Section 1.4) we have that w.h.p. has a unique giant component, and the rest of the connected components are isolated vertices, whose number is at most . Denote the set of these isolated vertices by . Together with Theorem 2 we also conclude that w.h.p.
[TABLE]
We may thus couple , , and such that
[TABLE]
by starting with and adding random copies of to create . Note that if none of these edges is fully contained in (and the coupling succeeds) then . Thus, there exist positive constants such that,
[TABLE]
4. Hamiltoncity and Perfect Matchings
The proof of Theorems 7 and 5 can be given in parallel, using the same techniques and tools. For clarity though, in this section we focus mainly on proving Theorem 7 and we give a sketch of the proof of Theorem 5 in the appendix.
For proving our Hamiltonicity result we use the standard technique of Posa’s rotations. We define Small to be the vertices of significantly smaller degree than the expected one and we set Large to be the rest of the vertices. We first show that small to medium subsets of Large expand and that the vertices in Small are well spread. This is done in the context of Lemmas 11 and 12, 13 respectively. We use these properties of Small and Large in order to prove all the the ingredients needed to apply the Posa’s rotations, which we gather in Lemma 14.
Let and recall that , . W.h.p. (see [FK]) we can couple and such that
- (i) and
- (ii) there are copies of in , hence in , that are not present in .
Observe that the above coupling and Theorem 7 imply Theorem 6. In addition a similar coupling and Theorem 5 imply Theorem 4.
We now define the sets Small, Large based on the degrees of the vertices in . Let \textsc{Large}=\{v\in V:v\text{ intersects at least}\ln\ln n\text{ copies of H in }G(H,n,p_{0})\} and .
Lemma 11**.**
W.h.p. every of size at most satisfies .
Lemma 12**.**
W.h.p. for every pair there do not exist copies of in that span a connected subgraph containing both . Hence w.h.p. every pair is at distance at least 7 in .
Lemma 13**.**
W.h.p. for every there exists at most one copy of in , hence in , that intersect both and .
Now we generate as follows. We first generate . Then we randomly permute the copies of not appearing in , let them be . We also let . We define the sequences and in the following way. At step we query whether it is incident to a vertex in Small. If it is then we set and . Otherwise we set and . Let and .
Given the sequence and the set we define the graph sequence by and for . Observe that consists of all copies of in that have not been added to , equivalently the copies of that are not incident to Small. Thus F_{w}=G^{\prime}_{t^{*}}\cup\big{(}\bigcup_{j=1}^{w}H_{i_{j}}\big{)}=G^{\prime}_{0}\cup\big{(}\bigcup_{i=1}^{t^{*}}H_{i}\big{)}=\bar{G}(H,n,\tau_{2}).
Lemma 14**.**
W.h.p. the following hold:
- i)
, 2. ii)
every of size at most satisfies in , 3. iii)
* is connected,* 4. iv)
for every , , and every set consisting of edges not present in there exist a constant such that the probability that intersects is at least .
We are now ready to apply Posa’s rotations . For that assume that is not Hamiltonian and consider a longest path in , , . Let be the end-vertices of . Given where is an interior vertex of we can obtain a new longest path where is the neighbor of on between and . In such a case we say that is obtained from by a rotation with the end-vertex being the fixed end-vertex.
Let be the set of end-vertices of longest paths of that can be obtained from by a sequence of rotations that keep as the fixed end-vertex. Thereafter for let be a path that has end-vertices and can be obtain form by a sequence of rotations that keep as the fixed end-vertex. Observe that for and there exists a - path of length that can be obtained from via a sequence of Posa rotations. Thus we can conclude that does not belong to . Indeed assume that . Then we can close into a cycle that is not Hamiltonian. Since is connected there is an edge spanned by . spans a path of length contradicting the maximality of . Similarly if then is either Hamiltonian or it contains a path that is longer than . At the same time it follows (see [FK]*Corollary 6.7) that
[TABLE]
Moreover for every
[TABLE]
As a consequence of Lemma 11, we have that and for every . Let . Then .
Now let be the indicator of the event and set . From Lemma 14 iv) we have (here ). In the event that is not Hamiltonian, while is a random variable for . Since we have . Hence w.h.p. is Hamiltonian and the hitting time for Hamiltonicity equals the hitting time for minimum degree 2.
5. Subgraph appearance
In there is only one way for a specified subgraph to appear on a fixed set of vertices: all the edges in the subgraph must be present. In the case of random motif graphs, there are multiple ways to place motifs so that a specified subgraph appears on a fixed set of vertices. For example, in a random two-path graph, a triangle may appear on if (i) the paths and are present or (ii) the paths , and are present. In order to pin down the threshold for subgraph appearance, it is necessary to understand the various motif configurations that cause the subgraph to appear and their relative probabilities. The following definition provides the notation to describe such configurations.
Definition 15**.**
Let be a set of vertices. Let be a fixed graph on a subset of the vertices of . Let be copies also defined on subsets of vertices of .
- (a)
We say is an covering of if (i) , (ii) , and (iii) for each , . 2. (b)
Let be the number of unique configurations of coverings, i.e. the number of ways to place copies of on vertices such that conditions (i)-(iii) of (a) hold. Enumerate the possible configurations of coverings with values in . For , an covering of is an covering with configuration . 3. (c)
We say the set (with precisely elements) is an subset of an covering if (i) , and (ii) . 4. (d)
Let \mathcal{I}(S,H)=\{(a,b,i)\>|\>\text{there exists an (a,b)SHi\in[k(a,b)]}\}. 5. (e)
For , let
[TABLE] 6. (f)
For , let and denote
[TABLE]
Proof of Theorem 8.
Let . We say that an instance of the subgraph in is an instance if the placed graphs that contribute at least one edge to form an covering of . Let denote the number of instances of in . Let be the total number of instances of the subgraph in .
First we use the first moment method to show that if , then the probability that occurs as a subgraph is . It suffices to show that for all , since
[TABLE]
and is a constant independent of .
We now compute for a fixed triple . Let be an subset of the configuration with . Let be the number of instances of in formed by the configuration . Since an instance of contains an instance of the configuration , . The number of ways to select vertices is at most . The probability that a labeled copy of is placed on a specified set of vertices is . We compute
[TABLE]
where is a constant depending only on the number of automorphisms of and the number of automorphisms of the configuration . It follows that for , , as desired.
Next we use the second moment method to show that if then appears as a subgraph almost surely. It suffices to show that there exists some such that is almost surely positive. Let be such that . We apply Corollary 4.3.5 of [AS] to show that is almost surely positive. Let where is an indicator random variable for the event that there is an instance of formed by a configuration of each present on a specified set of vertices. Fix , and let
[TABLE]
where indicates that and are not independent. By 4.3.5 of [AS], if and , then almost surely.
First we show that . We compute as above
[TABLE]
where is a constant depending only on the number of automorphisms of and the number of automorphisms of the configuration . It follows that if then .
Finally, we show . Observe that under the assumption ,
[TABLE]
6. Conclusion
6.1. The value of the random motif model
The study of random motif graphs has the potential to strengthen the impact of the Erdős-Rényi construction. In the context of analyzing real-world networks with an overrepresented motif, random motif graphs may be a more insightful null hypothesis model to compare against to identify non-random structure. For instance by studying subgraphs counts of random motif graphs one can determine if some larger motif pattern is a byproduct of having many copies of or is itself some novel aspect of the network structure. Moreover, it is possible that a random motif graph may be used to establish the existence of a graph with some extremal property of interest. Finally, random motif graphs can be used as an alternate definition of average case for analyzing algorithms under the assumption that the input has some motif structure.
6.2. Future directions: understanding threshold behavior more broadly
We have established that random motif graphs behave similarly to traditional Erdős-Rényi random graphs with respect to thresholds and hitting times for monotone properties. Does similar behavior appear when we consider random graphs formed by randomly adding primitive subgraphs whose size scales with , the number of vertices of the random graph? Instead of taking to be a fixed motif, could be a path, cycle, matching or clique whose size depends on , for example. Some of these cases were in fact studied in several contexts. For example, the union of random perfect matchings is contiguous to the random -regular graph, and is sometimes easier to analyze [Wor99]. Moreover, we can consider the class of models where itself is chosen from some probability distribution. In several cases, this has been studied as well. For instance, [FKT99] and [FGRV14] consider the case when is the uniform spanning tree, and [Pet18] considers the case when is an Erdős-Rényi random graph with constant density and size dependent on . Further study of these models is a first step toward delineating a larger family of random graphs that exhibit Erdős-Rényi like threshold and hitting time behaviors.
References
Appendix A Estimates for useful functions
Lemma 16**.**
For , if and then .
Proof.
Observe that for ,
[TABLE]
thus
[TABLE]
For , denote by the number of copies of that intersect but that are not contained in .
Lemma 17**.**
For , if and then .
Proof.
Observe that for ,
[TABLE]
thus
[TABLE]
For convenience we define for and ,
[TABLE]
Lemma 18**.**
For we have that .
Proof.
Write . Observe that it is strictly increasing in . Note also that
[TABLE]
It follows that
[TABLE]
so . ∎
Appendix B Minimum degree
Theorem 19**.**
With high probability
[TABLE]
Proof.
Let . It suffices to show that with high probability for
[TABLE]
and
[TABLE]
Proof of (1): Let . For let and .
[TABLE]
In addition,
[TABLE]
Chebyshev’s inequality give us,
[TABLE]
Hence with high probability there exist vertices of degree
Proof of (2): Let . Let be the event that in there exists a vertex of degree that lies on more than copies of . In the event there exists a vertex and a vertex set of size such that all the neighbors of lie in and at least copies of intersect , each in at least vertices. Therefore,
[TABLE]
In the event the number of vertices of degree less than is bounded by the number of vertices that are covered by at most copies of . Thus
[TABLE]
Appendix C Proofs of lemmas for Hamiltoncity
Proof of Lemma 11.
If there exists of size such that then there exist sets of size and respectively such that no copy of , satisfies and (take and to be any subset of of size ). The probability of such event occurring is bounded above by
[TABLE]
Now assume that there exists a set of size at most that satisfies . Since every vertex in is in at least copies of and every copy of covers vertices we have that intersects at least copies of . Each of those copies is spanned by . Therefore there exists a set of size that intersects at least copies of each, in at least 2 vertices. Since every vertex in Large has neighbors . The probability that such a set exists is bounded by
[TABLE]
Proof of Lemma 12.
For and let be the event that in intersects at most copies of that do not intersect . For ,
[TABLE]
Let be the event that for some there exist copies of in that span a connected subgraph containing both . If occurs then we can find a set such that i) the events , occur and ii) there exist in such that . Since all the aforementioned events are independent
[TABLE]
Proof of Lemma 13.
Lemma 12 implies that w.h.p. there do not exist and , such that in and are in a copy of and and are in a copy of . The probability that there exist , that are both contained in more than one copy of in is bounded by
[TABLE]
Proof of Lemma 14.
- (1)
Recall that we can couple such that w.h.p. and there are at least copies of in that are not present in . From Lemma 13 it follows that w.h.p. each of those copies that spans a vertex in Small also spans a unique vertex in . Hence . 2. (2)
Let , and set , . Lemma 11 implies that . In the case we have
[TABLE]
Next assume . Lemma 12 implies that no two vertices in Small are within distance 2 in , hence their neighborhoods are disjoint. Also has minimum degree 2. Therefore Now let where consists of all the vertices in that are within distance from and . If then since and have disjoint neighborhoods we have that
[TABLE]
Otherwise and . For let be the set of vertices in that are within distance from , hence . Lemma 12 states that no two vertices in Small are within distance 6, thus for the sets are disjoint. In addition since and , Lemma 11 implies that for all . Thus
[TABLE] 3. (3)
Assume that there exists a set such that is a connected component of and let . has minimum degree 2 therefore . Let and . Lemma 13 implies that every vertex in can be adjacent to at most 1 vertex in Small hence . Thereafter Lemma 11 implies that since otherwise
[TABLE]
Finally the probability that there exists a connected component of size in is bounded by
[TABLE]
iv) First we show that w.h.p. . Indeed by Markov’s inequality,
[TABLE]
Now let be a set of edges not present in and be the subset of consisting of the edges that are not incident to Small. Then w.h.p. . Every edge in belongs to copies of that are no present in and every copy of may cover at most edges in . Therefore there exists a set consisting of at least distinct copies of that intersect . is uniformly distributed among the copies of that are not present in and are not incident to a vertex in Small. Thus
[TABLE]
Appendix D Proof sketch of Theorems 4 and 5
To prove Theorem 5 we first indicate the edge set , consisting of the edges that are incident to vertices of degree 1. Then we delete these edges and the vertex set consisting of the vertices incident to them. Thereafter we use exactly the same techniques as above in order to find a Hamilton cycle in the remaining graph. We use half of the edges of that cycle and the edges in to form a perfect matching.
Given the above, the only substantial difference is that while generating (in place of ) we stop at time . The proofs of all Lemmas with exception the proof of Lemma 14, follow in exactly the same way. For the proof of Lemma 14 we have to be slightly more cautious as we want to prove the corresponding statements for the subgraph that is spanned by . Thus we have to use and in place of Small and Large respectively.
Appendix E Proof of Theorem 9
Before proving Theorem 9, we derive an expression for and establish the following upper bound on .
Lemma 20**.**
Consider an covering of by a path of length and an subcovering with connected components. Let be the subgraph of covered by connected component of the subcovering. Let and . Let be the number of duplicate edges in the subcovering, i.e. is the smallest integer such that removing edges from multigraph union of copies of can yield a simple graph. Then
[TABLE]
and
[TABLE]
Proof.
We compute . Note that each of the copies of contributes vertices, however vertices may be counted multiple times. We compute
[TABLE]
where the first term subtracted corresponds to doubling counting vertices in each connected component and subtracting corresponds to removing double counting for vertices adjacent to edges of that are covered multiple times.
By definition, . For the subcover that is the entire cover, , and if is edge-disjoint and if is not edge-disjoint. Thus, Equation 4 follows directly from Equation 3. ∎
Proof.
(of Theorem 9.) We consider each case separately.
Case: . Consider an covering. If is edge-disjoint, then . It follows from Equation 4 that
[TABLE]
Thus .
Next consider an edge-disjoint cover of by copies of , . By Equation 3, for any subcover of the cover,
[TABLE]
This value is minimized with and , which is achieved by the subcover which is the whole cover. Thus , and so .
Case: . By Equation 4, for all and so it follows that .
Next consider an edge-disjoint cover of , . By Equation 3, for any subcover of the cover,
[TABLE]
This value is minimized with , which is achieved by the subcover which is the whole cover. Thus , and so .
Case: . Consider an cover. By Equation 3,
[TABLE]
Let and be the number of edges and vertices of covered by the subcover, so . It follows
[TABLE]
To give an upper bound on , we construct a subcover of the cover as follows. Let be a subgraph of with vertices and edges such that . Let correspond to the subcover that minimally covers , and let be the subgraph of covered by this subcover (so is a subgraph of ).
We claim that . Note that and . In each component of , at least one vertex is included in . Since the number of vertices in a connected component is at least the number of edges in the connected component minus one, and at least one vertex in each connected component is not included in , it follows that . Thus and the claim follows.
By considering this subcover with parameters , we obtain
[TABLE]
since and . It follows that .
Finally to lower bound we consider a cover in which there are copies of and each copy covers precisely one edge of . In this case in all subcovers . By Equation 5
[TABLE]
Thus . ∎
