Beating Treewidth for Average-Case Subgraph Isomorphism
Gregory Rosenthal

TL;DR
This paper demonstrates that for certain graphs like hypercubes, the average-case complexity of the subgraph isomorphism problem can be significantly lower than the worst-case treewidth-based bounds, by analyzing related graph parameters.
Contribution
It proves that the embedding parameter is bounded by the complexity parameter, shows that the complexity parameter can be asymptotically less than treewidth, and constructs circuits that solve the problem efficiently on average.
Findings
Average-case complexity for hypercubes is sublinear in treewidth.
The embedding parameter is bounded by the complexity parameter.
Constructed circuits match upper and lower bounds for average-case complexity.
Abstract
For any fixed graph , the subgraph isomorphism problem asks whether an -vertex input graph has a subgraph isomorphic to . A well-known algorithm of Alon, Yuster and Zwick (1995) efficiently reduces this to the "colored" version of the problem, denoted -, and then solves - in time where is the treewidth of . Marx (2010) conjectured that - requires time and, assuming the Exponential Time Hypothesis, proved a lower bound of for a certain graph parameter . With respect to the size of circuits solving - in the average case, Li, Razborov and Rossman (2017) proved (unconditional) upper and lower bounds of and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
{NoHyper}
Beating Treewidth for Average-Case Subgraph Isomorphism
Gregory Rosenthal
University of Toronto Email: [email protected]. Supported by NSERC (PGS D).
Abstract
For any fixed graph , the subgraph isomorphism problem asks whether an -vertex input graph has a subgraph isomorphic to . A well-known algorithm of Alon, Yuster and Zwick (1995) efficiently reduces this to the “colored” version of the problem, denoted -, and then solves - in time where is the treewidth of . Marx (2010) conjectured that - requires time and, assuming the Exponential Time Hypothesis, proved a lower bound of for a certain graph parameter . With respect to the size of circuits solving - in the average case, Li, Razborov and Rossman (2017) proved (unconditional) upper and lower bounds of and for a different graph parameter .
Our contributions are as follows. First, we prove that is for all graphs . Next, we show that can be asymptotically less than ; for example, if is a hypercube then is \Theta\mathopen{}\mathclose{{}\left(\mathit{tw}(G)\big{/}\sqrt{\log\mathit{tw}(G)}}\right). This implies that the average-case complexity of - is when is a hypercube. Finally, we construct circuits of size that solve - in the average case, closing the gap between the upper and lower bounds of Li et al.
1 Introduction
The subgraph isomorphism problem asks, given graphs and , whether has a subgraph isomorphic to . In the “colored” or “partitioned” version of the problem, each vertex of the larger graph comes with a “color” from the vertex set of , and we ask whether has a subgraph that is isomorphic to with respect to this coloring. We denote the uncolored and colored subgraph isomorphism problems by \text{G\mathsf{SUB}_{\mathrm{uncol}}}(X) and \text{G\mathsf{SUB}}(X) respectively.
Subgraph isomorphism is NP-complete (e.g. if is a clique or Hamiltonian cycle), so research has focused on algorithms for a variety of special cases in the context of parameterized complexity, surveyed in [MP14]. If is a fixed graph on vertices then - is solvable in time by brute force, where (here and throughout this section) is the order of the input graph. The color-coding algorithm of Alon, Yuster and Zwick [AYZ95] improves on this by efficiently reducing - to - and solving the latter in time , where is the treewidth of the fixed graph .
The exponent can sometimes be improved using fast matrix multiplication [NP85, EG04], but no significantly faster algorithm is known for either the colored or uncolored subgraph isomorphism problem. Marx [Mar10] conjectured the following:
Conjecture 1.1**.**
There is no class of graphs with unbounded treewidth, no algorithm that on inputs and solves \text{G\mathsf{SUB}}(X), and no function such that if is in then runs in time .
Marx [Mar10] came close to proving 1.1 assuming the Exponential Time Hypothesis (ETH) [IPZ01], which is the hypothesis that solving 3SAT on variables requires time. We state his result in terms of a parameter (short for “embedding”) which we will define in Section 4:
Theorem 1.2** ([Mar10]).**
Assuming ETH, there is no class of graphs with unbounded treewidth, no algorithm that on inputs and solves \text{G\mathsf{SUB}}(X), and no function such that if is in then runs in time .
Marx [Mar10] proved that is , so Theorem 1.2 comes within a logarithmic factor in the exponent of proving 1.1 (under ETH). However, our results include a counterexample to an average-case analogue of 1.1, in a sense that will be made precise in Section 3. Moreover, this counterexample holds in , i.e. on unbounded-fanin boolean circuits of depth depending only on .
Li, Razborov and Rossman [LRR17] proved that for fixed , the average-case complexity of - is between and , where is a graph property and is an absolute constant.111In [LRR17], the parameter was called . (See Section 3 for Li et al.’s definition of ; we also prove that can be equivalently defined in terms of the transition matrix of a certain random walk on .) We tighten this gap, answering a question posed in [LRR17]:
Theorem 1.3**.**
There is a constant such that for any fixed graph , the average-case complexity of - is at most .
We observe that a similar result holds easily on Turing machines, using as a subroutine the sort-merge join algorithm from relational algebra. This involves sorting, which cannot be done in (polynomial-size) [Hås86], so our circuit instead uses hashing that relies on concentration of measure for subgraphs of random graphs.
Li et al. [LRR17] also proved that is between and , from which it follows that the worst-case complexity of - is at least in . Li et al. posed the question of whether is ; an affirmative answer would have implied that 1.1 holds in .
However, the following example separates from treewidth. The Hamming graph has vertex set and edges between every two vertices that differ in exactly one coordinate. It is already known that has treewidth \Theta\mathopen{}\mathclose{{}\left(q^{d}\big{/}\sqrt{d}}\right) [CK06]. We prove the following:
Theorem 1.4**.**
\kappa\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)* is .*
Thus, if is the hypercube graph for example, then is \Theta\mathopen{}\mathclose{{}\left(\mathit{tw}(G)\big{/}\sqrt{\log\mathit{tw}(G)}}\right). It follows that an average-case analogue of 1.1 is false if is taken to be the set of all hypercubes. We also prove the following (for arbitrary graphs ):
Theorem 1.5**.**
* is .*
Because of Theorem 1.5, even if our upper bound generalizes to the worst case, it is still consistent with current knowledge (in particular Theorem 1.2) that ETH is true. Another consequence of Theorem 1.5 is that the lower bound from Theorem 1.2 holds unconditionally in .
It follows from Theorems 1.4 and 1.5 that if is a hypercube then , so proving that 1.1 holds under ETH cannot be done by proving that is . In fact, this conclusion was already known: Alon and Marx [AM11] proved that if is a 3-regular expander then is . Li et al. [LRR17] proved that if is a 3-regular expander then is , which makes our separation of from treewidth more surprising. On the other hand, we will see that Theorem 1.5 is tight in the case of Hamming graphs.
We can make a similar statement regarding . Amano [Ama10] observed that the color-coding algorithm for - can be implemented by circuits of size for fixed . Our separation of from treewidth implies that if 1.1 holds in , then this cannot be proved using average-case complexity as defined here and in [LRR17].
The paper is organized as follows. In Section 2 we introduce some notation and definitions. In Section 3 we define the average-case problem and , and give an -time algorithm for the average-case problem. In Section 4 we define and prove that is . In Section 5 we prove that \kappa\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right) is , and obtain as a corollary that \mathit{emb}\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right) is as well. We also summarize the proof of Chandran and Kavitha [CK06] that \mathit{tw}\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right) is \Theta\mathopen{}\mathclose{{}\left(q^{d}\big{/}\sqrt{d}}\right). In Section 6 we prove our upper bound.
2 Preliminaries
It will be convenient to define . (This differs from the standard notation when .) We will often fix a graph , in which case the constants hidden in asymptotic notation are allowed to depend on .
We use boldface to denote random variables. The indicator variable equals 1 if the event occurs and 0 otherwise. Expected value is denoted . An event occurs asymptotically almost surely (a.a.s.) if it occurs with probability as goes to infinity.
Let for . If a positive real number is used in a context where a natural number is expected (for example ), it’s because can be rounded arbitrarily to or without affecting the asymptotic behavior of whatever is being considered.
2.1 Graphs
All graphs we consider are simple and undirected, and may have isolated vertices. If is a graph then let and denote its vertex and edge sets, with respective cardinalities and . If and are adjacent vertices then we denote the edge connecting them by or . A graph is a subgraph of , denoted , if and .
Definition 2.1** (Colored subgraph isomorphism problem).**
For graphs and , where comes with a coloring , the problem \text{G\mathsf{SUB}}(X) asks whether has a subgraph such that (restricted to ) is an isomorphism from to .
For let be the induced subgraph of on , and more generally let . Let , and for let .
When the parent graph is clear in context, let be the degree of a vertex , and for disjoint let be the number of edges between and . Similarly, for vertex-disjoint graphs let .
Let be the graph with vertex set and edge set , and define similarly. Note that may have isolated vertices even if and do not. If are graphs then let , and let be the same interval without , etc.
The Cartesian product of graphs and , denoted , has vertex set and edges for all and , and for all and . Let be the Cartesian product of copies of .
We denote by the complete graph on vertices, also called the -clique. It follows that has vertex set , and two vertices are adjacent if and only if they differ in exactly one coordinate. Such graphs are called Hamming graphs. A special case is the -dimensional hypercube ; we will use for its vertex set.
Definition 2.2** (Graph minor).**
A graph is a minor of a graph if there exists a minor mapping assigning a connected component of to each vertex of , such that and are vertex-disjoint for all , and if then there exists an edge in with endpoints in and .
In particular, any subgraph of is also a minor of (e.g. let be the identity).
Definition 2.3** (Treewidth).**
A tree decomposition of a graph is a tree whose vertices are subsets of (called “bags”), such that each vertex and edge of is contained in at least one of the bags, and for all , the induced subgraph of on the bags that contain is a connected subtree of . The width of is one less than the size of the smallest bag, and the treewidth of , denoted , is the minimum width over all tree decompositions.
Roughly speaking, a graph has small treewidth if and only if it’s “similar to a tree”. See e.g. [Bod98, BK08] for further background, and [HW17] for a survey of parameters that are polynomially tied to treewidth.
The edge expansion of a graph is defined as follows:
[TABLE]
A bounded-degree expander is a graph with edge expansion and maximum degree (see [HLW06] for a survey). Let be the ’th largest eigenvalue of the adjacency matrix of . We will use the following half of Cheeger’s Inequality:
Fact 2.4** ([AM85]).**
If is a -regular graph then .
Finally, let \mathbf{ER}\mathopen{}\mathclose{{}\left(n,p}\right) be the Erdős-Rényi graph on vertices in which each possible edge exists independently with probability .
3 The Average-Case Problem and the Parameter
3.1 Threshold Random Graphs
First we will define threshold weightings, which assign weights to the vertices and edges of a graph subject to certain constraints. Then we will define a family of random graphs for each threshold weighting. The content in this subsection is essentially all from [LRR17].
Definition 3.1**.**
A threshold weighting on a graph is a pair with the following property. For let and , and let . Then, for all , and . Let be the set of threshold weightings on .
We will often denote in a slight abuse of notation. (Since if is a single vertex, the pair is uniquely determined by .) The requirement that be nonnegative is redundant because it’s a special case of the requirement that be nonnegative. The requirement that is also redundant because for every edge ,
[TABLE]
It will sometimes be convenient to define for , e.g. for disjoint sets let , and for vertex-disjoint let .
Example 3.2** (Markov Chains).**
Let be a column stochastic matrix (meaning each column sums to 1) such that if then either or . Let for all , and for all . Then for all ,
[TABLE]
with equality if . In fact, we prove that every threshold weighting is equivalent to at least one Markov Chain (Appendix A).
The following threshold weighting will be especially important, and can be thought of as representing a uniform random walk on :
Definition 3.3**.**
If lacks isolated vertices then let be the threshold weighting generated in Example 3.2 when . That is, , where for all and for all . If is -regular then this simplifies to .
Now we define threshold random graphs:
Definition 3.4**.**
For let be the graph with vertices for and , and for , each edge independently with probability . The graph comes with the coloring to defined by .
For and in the support of , let be the set of subgraphs such that the aforementioned coloring (restricted to ) is an isomorphism from to . We say that such a graph is “-colored”. Note that can be identified with a subset of .
Lemma 3.5**.**
If and then .
Proof.
Let . The set contains each of its possible elements with probability , so the result follows from linearity of expectation. (The accounts for having to round to an integer.) ∎
Lemma 3.5 motivates the requirements that be nonnegative everywhere and that . Recall that the problem \text{G\mathsf{SUB}}(X) asks whether is the empty set. Since is required to be zero, it follows that has (approximately) one element on average, and the probability that is empty is known to be bounded away from 0 and 1 as goes to infinity [LRR17].
3.2 The Parameter and an Algorithm for the Average Case
We now define :
Definition 3.6** ([LRR17]).**
Let be a graph with no isolated vertices. Let be the set of union sequences, meaning sequences of distinct subgraphs of such that and each is either an edge or the union of two previous graphs in the sequence. For let . Finally, let .
To simplify the exposition, whenever we refer to , the graph is implicitly assumed to lack isolated vertices. Li et al. [LRR17] proved that for any fixed , circuits solving \text{G\mathsf{SUB}}(\mathbf{X}_{\Delta,n}) a.a.s. require size at least and at most (where is an absolute constant). The results about average-case complexity described in Section 1 are with respect to a such that .
Theorem 3.7**.**
The problem \text{G\mathsf{SUB}}(\mathbf{X}_{\Delta,n}) can be solved in time a.a.s. for any fixed .
Proof.
First we prove a weaker upper bound of , in a manner analogous to the circuit from [LRR17], and then we describe a modification (on Turing machines) that removes the factor of 2 from the exponent. Later we will remove the factor of 2 in using a different approach, summarized at the beginning of Section 6.
Let be a union sequence such that . For any , by Lemma 3.5 and Markov’s Inequality, P\mathopen{}\mathclose{{}\left(|\mathrm{Sub}_{\mathbf{X}_{\Delta,n}}(H)|>n^{\Delta(H)}\log n}\right)\leq 1/\log n. (We will obtain a tighter bound of in Section 6.1.) By a union bound it follows that if then a.a.s. Assume this condition holds for . For each successive in , compute as follows. If is a single edge then this is trivial. Otherwise for some previous , in which case is the set of such that and the projections of and onto are equal. Therefore can be computed by brute force in time . Finally, check whether is empty.
We can save a quadratic factor by computing from and as follows. (This is a case of the sort-merge join algorithm for computing the natural join of two relations, as defined in database theory [SKS11].) Define a partial order on by projecting onto and applying the lexicographic order on . Sort and in nondecreasing order, and for convenience add the symbol to the end of both sorted lists. Let and be the first elements of and respectively, and initialize an empty accumulator (which will ultimately equal ). While and , do the following. If then let be the next element of . If then let be the next element of . Otherwise, let , and while and the projections of and onto are equal, add to the accumulator and let be the next element of . Then (once the procedure involving has finished) let be the next element of .
Sorting and takes comparisons, each of which takes time, and then computing takes time. ∎
We will use the following graph-theoretic properties of :
Theorem 3.8** ([LRR17]222Specifically, Corollary 4.2, Theorem 4.9, and Theorem 5.1 of [LRR17] correspond to Items 3.8(i), 3.8(ii) and 3.8(iii) respectively.).**
Let be a graph with no isolated vertices.
- (i)
There exists (meaning for all vertices ) such that . 2. (ii)
, where is the edge expansion of . 3. (iii)
If is a minor of some graph then .
Corollary 3.9**.**
- (i)
If is a bounded-degree expander then is . 2. (ii)
If is a -regular graph then .
Proof of 3.9.
Item 3.9(i) follows from Item 3.8(ii), as observed by Li et al. [LRR17]. Item 3.9(ii) follows from Items 3.8(ii) and 2.4. ∎
4 The Parameter and Proof that is
Recall that is significant because of its role in Marx’s ETH-hardness result for -, namely Theorem 1.2.
Definition 4.1** ().**
Let G^{\mathopen{}\mathclose{{}\left(q}\right)} be the graph formed by replacing each vertex of with a -clique, i.e. it has vertices for all and , and edges for all such that either or . Let be the supremum of all for which there exists such that if is any graph with edges and no isolated vertices, then is a minor of G^{\mathopen{}\mathclose{{}\left(\lceil m/r\rceil}\right)}, and furthermore a minor mapping from to G^{\mathopen{}\mathclose{{}\left(\lceil m/r\rceil}\right)} can be computed in time for some function .
Although the requirement that such a minor mapping be efficiently computable is crucial in Theorem 1.2, none of the other results about that we reference or derive depend on this requirement, so we may safely ignore it going forward. The following example illustrates 4.1:
Example 4.2** ( [Mar10]).**
Since K_{k}^{\mathopen{}\mathclose{{}\left(\lceil m/r\rceil}\right)}=K_{k\lceil m/r\rceil}, any graph with edges is a minor of K_{k}^{\mathopen{}\mathclose{{}\left(\lceil m/r\rceil}\right)} if and only if . If has no isolated vertices then could have up to vertices, so . Therefore : it is sufficient for to be at most (i.e. ), and no satisfies for arbitrarily large .
Remark*.*
The name comes from the fact that Marx [Mar10] called a minor mapping from to G^{\mathopen{}\mathclose{{}\left(q}\right)} an “embedding of depth ” from into . Marx used the notation G^{\mathopen{}\mathclose{{}\left(q}\right)}, but the parameter is new in the current paper, all results about in [Mar10, AM11] having been stated in terms of embeddings of some depth.
The following is used in our proof that is :
Lemma 4.3**.**
\kappa\mathopen{}\mathclose{{}\left(G^{\mathopen{}\mathclose{{}\left(q}\right)}}\right)\leq q\max(\kappa(G),2).
Proof.
Let \Delta=(\alpha,\beta)\in\theta\mathopen{}\mathclose{{}\left(G^{\mathopen{}\mathclose{{}\left(q}\right)}}\right) such that \kappa\mathopen{}\mathclose{{}\left(G^{\mathopen{}\mathclose{{}\left(q}\right)}}\right)=\kappa_{\Delta}\mathopen{}\mathclose{{}\left(G^{\mathopen{}\mathclose{{}\left(q}\right)}}\right). Define a threshold weighting as follows: For all and ,
[TABLE]
This is a threshold weighting because if then \Delta^{\prime}(H)=\Delta(H^{\mathopen{}\mathclose{{}\left(q}\right)})/q\geq 0, with equality if . It’s also normalized to .
Let be an optimal union sequence for with respect to . Construct a union sequence for G^{\mathopen{}\mathclose{{}\left(q}\right)} as follows:
For each append an arbitrary union sequence for e^{\mathopen{}\mathclose{{}\left(q}\right)}. 2. 2.
For each (in order) append H^{\mathopen{}\mathclose{{}\left(q}\right)}.
If H\subseteq e^{\mathopen{}\mathclose{{}\left(q}\right)} then \Delta(H)\leq\alpha(e^{\mathopen{}\mathclose{{}\left(q}\right)})\leq 2q, and we’ve already seen that \Delta(H^{\mathopen{}\mathclose{{}\left(q}\right)})=q\Delta^{\prime}(H) for all . Therefore,
[TABLE]
Now we prove that is (Theorem 1.5), using an argument similar to the proof by Marx [Mar10] that is :
Proof.
Let , and assume there exists an arbitrarily large 3-regular expander that’s a minor of G^{\mathopen{}\mathclose{{}\left(\lceil e(H)/r\rceil}\right)}. Then by Items 3.9(i), 3.8(iii) and 4.3,
[TABLE]
so must be . ∎
Li et al. [LRR17] posed the question of whether Theorem 1.2 holds with in place of . By Theorem 1.5 this would be a stronger bound, which makes the question even more interesting. This problem is open even in the case of 3-regular expanders: recall from Section 1 that if is a 3-regular expander then is and is [AM11, LRR17].
The fact that is gives an alternate proof, besides the one in [LRR17], that is .
5 Separating from Treewidth
In Section 5.1 we prove that , which is a special case of the more general result that \kappa\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)=\Theta(q^{d}/d), and improves on the observation of Li et al. [LRR17] that is . We obtain tighter multiplicative constants in the case , and it provides an opportunity to illustrate the main ideas of our proof in a simpler setting, but it may be skipped without penalty. In Section 5.2 we prove that \kappa\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right) is when is even, which is sufficient to separate from treewidth. Again, this case is cleaner than the general case and conveys most of the intuition behind it. In Appendix B we prove that \kappa\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right) is for all . In Section 5.3 we prove that \kappa\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right) is in two different ways, completing the proof that \kappa\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right) is (Theorem 1.4), and we obtain as a corollary that \mathit{emb}\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right) is as well. In Section 5.4 we summarize the proof of Chandran and Kavitha [CK06] that \mathit{tw}\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right) is \Theta\mathopen{}\mathclose{{}\left(q^{d}\big{/}\sqrt{d}}\right).
5.1 Proof that
Rossman [Ros08] proved that (recall 3.3), so it suffices to prove the upper bound. By Item 3.8(i) it suffices to prove that for an arbitrary . First we construct, by downwards induction, a sequence such that is an -element subset of and for all . The set satisfies this requirement because and are both equal to . Given , let be an -element subset of chosen uniformly at random. Each pair of elements in is included in with the same probability (), so by linearity of expectation,
[TABLE]
Therefore there exists a fixed such that .
Construct a union sequence for as follows: start by enumerating the edges, and then for from 1 to , append , where are the edges between and . Then,
[TABLE]
Finally, as observed in [Ros08], since is -regular it follows from Eq. 1 that
[TABLE]
5.2 Proof that \kappa\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right) is if is Even
First we reduce to the case . The graph is a subgraph of Q_{d}^{\mathopen{}\mathclose{{}\left((q/2)^{d}}\right)} (recall 4.1), as evidenced by the following argument. Let and such that is a bijection from to , and let be another arbitrary bijection. Then the following map is an injective homomorphism from to Q_{d}^{\mathopen{}\mathclose{{}\left((q/2)^{d}}\right)}:
[TABLE]
By Items 3.8(iii) and 4.3, if is then
[TABLE]
Now we prove that is , following some brief definitions and a high-level overview of the argument. Fix . We identify each with . For let . Recall that is a threshold weighting on (3.3). Let .
Remark*.*
The intuition behind is as follows. The reader may note that , by reasoning analogous to that in Section 5.1. That is, for each vertex of in increasing lexicographic order, add to an accumulator all edges for which .
There is another union sequence captured by as well. If a subgraph is isomorphic to for some , then since is isomorphic to (and is uniform) it follows that . Consider a depth- binary tree in which each node at depth is a subgraph of isomorphic to (in particular, the root is and the leaves are vertices), and each interior node is the union of its two children along with some additional edges corresponding to a coordinate cut. This tree describes a union sequence for : recursively obtain the graphs and corresponding to the children of , and then take and add the missing edges. Note that .
Analogous to Section 5.1, the upper bound is obtained by comparing to for each , and bounding . For this purpose we will consider the two union sequences mentioned above, as well as hybrids of them.
The proof is as follows:
[TABLE]
For each threshold weighting , it will be convenient in the following to generalize to subgraphs by . (This is a nontrivial generalization of the definition of , because if then the restriction of to subgraphs of is not a threshold weighting on .) Also if is a single-vertex graph or the empty graph then let .
Lemma 5.1**.**
Let and be such that divides . Let be such that and , and . Then .
Proof.
The proof is by induction on . The inductive hypothesis will actually be (slightly) stronger in the following way: given a labeling of the vertices of with the elements of , the labels can be rearranged according to any of the isomorphisms of , and the inductive hypothesis is required to hold with respect to any such labeling. (The value of doesn’t depend on the labeling used in its definition because of the symmetry of .)
Let = . Since divides , it follows that is isomorphic to . In the inductive step we handle separately the cases where and . The base case is a special case of the former because if is a single vertex then and are both zero.
Case 1: . If then ; we now obtain the same result in the case where . For and let . Choose and independently and uniformly at random. By symmetry, each is in with the same probability . (Specifically, : For any edge , there is a unique index in which and differ. If then exactly one of and is in ; otherwise is in with probability 1/2 depending on .) By linearity of expectation,
[TABLE]
Similarly, . By our assumption that ,
[TABLE]
Therefore there exist fixed and such that .
Now our claim that follows from two applications of the inductive hypothesis. Since we required the inductive hypothesis to hold for all labelings of , we can assume without loss of generality that and . Ignoring , an application of the inductive hypothesis with and reveals that , and then a second application of the inductive hypothesis with and reveals that .
Let be an optimal (with respect to ) union sequence for , followed by an optimal union sequence for , followed by , where the are the edges between and in .333It is also necessary to add each edge individually to the union sequence, but clearly if , and if then the lemma holds trivially. (If or lacks edges then omit certain graphs from this sequence.) Then,
[TABLE]
We proceed to bound each of these three terms by , completing the proof. We have assumed that , and proved that . We have also assumed that , and since and both evaluate to 1 on all vertices, it follows that (with the last step following from the definition of ). Similarly, since is isomorphic to it follows that . Therefore .
Case 2: . For and let (where is defined as above). Choose and independently and uniformly at random. Note that .444Recall from 3.1 that . By reasoning similar to that in the previous case (and applying our various assumptions),
[TABLE]
Therefore for some fixed and .
Assume without loss of generality that and ; then . Applying the inductive hypothesis with and reveals that , and then applying the inductive hypothesis with and reveals that . ∎
Lemma 5.2**.**
.
Proof.
For any , it follows from Eq. 1 that , so it suffices to prove that for all . Let . Since
[TABLE]
(as can be seen by applying the automorphism to ), we can restrict our search to . In that case,
[TABLE]
By the same reasoning,
[TABLE]
Since (consider a similar automorphism), it follows that if then
[TABLE]
Therefore we can restrict our search to , in which case
[TABLE]
By induction it follows that . ∎
Remark*.*
Harper [Har04] proved that out of all subgraphs of with vertices, has the fewest outgoing edges [Fil15].
5.3 Proof that \kappa\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right) is and \mathit{emb}\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right) is
Alon and Marx [AM11, Theorem 4.3] proved that \mathit{emb}\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right) is , and it follows from Theorem 1.5 that \mathit{emb}\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)\leq O\mathopen{}\mathclose{{}\left(\kappa\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)}\right)\leq O(q^{d}/d). Therefore \mathit{emb}\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right) is .
It is implicit in the above argument that \kappa\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)\geq\Omega\mathopen{}\mathclose{{}\left(\mathit{emb}\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)}\right)\geq\Omega(q^{d}/d); we now present an alternate proof that \kappa\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right) is based on edge expansion. Since is -regular, by Item 3.9(ii) it suffices to prove that 1-\lambda_{2}\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)/d(q-1) is . We use the following well-known fact, where graphs are identified with their adjacency matrices:
Fact 5.3**.**
The eigenvalues of are for .
Proof.
Observe that , where the symbols and denote the tensor product and the identity matrix respectively. Let (resp. ) be the ’th eigenvector of (resp. ); clearly is an eigenvector of with eigenvalue . Since a real symmetric matrix (in particular or ) has an orthogonal eigenbasis, it follows that the are also orthogonal. Since , there are no other eigenvalues of . ∎
Since equals if and otherwise, repeated application of Fact 5.3 reveals that \lambda_{2}\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)=(q-1)(d-1)-1=d(q-1)-q, so 1-\lambda_{2}\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)/d(q-1)=q/(q-1)d, as desired.
Remark*.*
Fact 2.4 is an equality in the case of hypercubes (see e.g. [HLW06]): let and define a cut by partitioning the vertices according to the values of their ’th coordinates. So for hypercubes, all slack in the application of Item 3.9(ii) comes from Item 3.8(ii).
5.4 Proof that \mathit{tw}\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right) is \Theta\mathopen{}\mathclose{{}\left(q^{d}/\sqrt{d}}\right), Summarized
(See [CK06] for the full proof.) The proof that \mathit{tw}\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right) is O\mathopen{}\mathclose{{}\left(q^{d}\big{/}\sqrt{d}}\right) reduces to the case by reasoning analogous to that in the beginning of Section 5.2. For let be the set of vertices of with exactly or ones. The path is a tree decomposition of with width approximately , and by Stirling’s approximation this is \Theta\mathopen{}\mathclose{{}\left(2^{d}\big{/}\sqrt{d}}\right).555Compared to the tree decomposition from [CK06], this one is a simpler variant whose width is larger by up to a constant factor.
For a graph let be the minimum over all of the number of vertices in with at least one neighbor in . From a result of Robertson and Seymour [RS86] it follows that , and from a result of Harper [Har99] it follows that \phi\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right) is \Omega\mathopen{}\mathclose{{}\left(q^{d}\big{/}\sqrt{d}}\right). (Also note the parallels between and Item 3.8(ii); interestingly, we’ve sign that both are tight to within a constant factor in the case of .)
6 Upper Bound
An circuit is a constant-depth circuit with unbounded-fanin AND and OR gates and NOT gates. Fix a graph and threshold weighting for the remainder of this section. We prove the following, which is a more precise statement of Theorem 1.3:
Theorem 6.1**.**
There exists an circuit with wires that solves \text{G\mathsf{SUB}}(\mathbf{X}_{\Delta,n}) with probability , where is an absolute constant.
Since in any circuit the number of gates is at most one plus the number of wires, the circuit from Theorem 6.1 has size . (In this discussion, all terms in an exponent are independent of .) For comparison, it was proved in [LRR17] (building on a line of previous work [Ros08, Ama10, Ros10, NW11]) that the average-case complexity of \text{G\mathsf{SUB}}(\mathbf{X}_{\Delta,n}) is between and . Another related result, regarding the uncolored -clique problem, is that the average-case complexity of \text{K_{k}\mathsf{SUB}_{\mathrm{uncol}}}\mathopen{}\mathclose{{}\left(\mathbf{ER}\mathopen{}\mathclose{{}\left(n,n^{-2/(k-1)}}\right)}\right) is at most [Ama10, Ros14] ( by Section 5.1). See [Ros18] for a survey of the average-case circuit complexity of subgraph isomorphism more generally.
One challenge to implementing the algorithm behind Theorem 3.7 in is that sorting cannot be done in (polynomial-size) [Hås86]. The -size circuit from [LRR17] computes by finding the relevant pairs in by brute force with gates. Our circuit differs in that we represent as a depth- tree, where the non-root vertices are assigned labels in , and the (sequences of labels along the) paths from the root to the leaves correspond to the elements of . This will allow us to compute with high probability given and , on a circuit of size nearly linear in . A key fact in our construction is that circuits can (with high probability) convert between representations of corresponding to different orderings of .
Our construction requires fairly precise estimates for how many children to assign each node. Luckily this number is highly concentrated around its mean if the input graph is . This result will follow from the concentration inequality below, whose statement requires several definitions:
Definition 6.2**.**
Let be in the support of , and let be an arbitrary graph (which we think of as a “universe”). Let be the set of all possible elements of ; note that this can be identified with . If and then let extend to in if there exists a graph (called a -extension of ) such that . (In context, or will be implicit.) Equivalently, could be required to be in rather than in the latter definition.
Let . Let be good if for all graphs and , and for all and vertices , there are \tilde{O}\mathopen{}\mathclose{{}\left(n^{\Delta^{*}_{U}(A\cup v)-\Delta^{*}_{U}(A)}}\right) values of such that extends to . (Recall our unconventional definition of from Section 2, e.g. denotes .) Finally, let an event occur with high probability (w.h.p.) if it occurs with probability .
We prove the following:
Theorem 6.3**.**
The graph is good w.h.p.
Observe that this is a substantially stronger concentration bound than the application of Markov’s Inequality in the proof of Theorem 3.7. In Section 6.1 we prove Theorem 6.3, and then in Section 6.2 we use this result to prove Theorem 6.1. Both proofs use the following concentration inequality, which is proved by a Chernoff bound:
Lemma 6.4**.**
If is a sum of independent Bernoulli random variables then w.h.p. .
Proof.
Let be a decomposition of as a sum of independent Bernoulli random variables. Let and . Then for ,
[TABLE]
Letting gives assuming , and then (for example) letting gives, for sufficiently large ,
[TABLE]
6.1 Proof of Theorem 6.3
First we derive some algebraic properties of the threshold weighting .
Lemma 6.5**.**
If then .
Proof.
Each vertex or edge in one (resp. two) of and is also in one (resp. two) of and . ∎
Definition 6.6**.**
For let , and let be a -base if .
Throughout this subsection, will be an arbitrary subgraph of unless additional structure is imposed on it, and missing subscripts on and default to .
Lemma 6.7**.**
If then and .
Proof.
It suffices to show that the set is closed under intersection. Let . By the definition of , Lemma 6.5, and the fact that ,
[TABLE]
so . On the other hand, because . Therefore , so . ∎
Lemma 6.8**.**
If then is a -base.
Proof.
Since the interval includes the interval , it follows from Lemma 6.7 that . Therefore . ∎
Lemma 6.9**.**
If then .
Proof.
Since it follows that , so by Lemmas 6.7 and 6.5,
[TABLE]
Therefore . On the other hand, since and it follows that , so . Therefore , so it follows from the definition of that . ∎
We now analyze the concentration of , making liberal use of the fact that if events occur with uniformly high probability then their conjunction also occurs w.h.p. by a union bound. For the rest of this subsection, “extensions” are with respect to an implicit .
Lemma 6.10**.**
If and (i.e. for all ) then the number of -extensions of any is w.h.p.
(The above conditions are equivalent because, by the definition of , we have if and only if is the unique that minimizes .)
Proof.
The result is trivial for ; assume it’s true for all and that . (Since for all , for any it is the case that for all .) Assume without loss of generality that , since all -extensions of are also -extensions of ’s unique possible -extension. Also condition on , since otherwise trivially has zero -extensions.
There are possible -extensions of , so there are at most sets of possible -extensions of whose projections onto are pairwise vertex-disjoint. (This is true even if we omit the condition about vertex-disjointness.) For each of these sets, all of its elements are subgraphs of with probability , so this occurs for at least one such set with probability at most (by a union bound). By assumption, , so w.h.p. any set of -extensions of whose projections onto are pairwise vertex-disjoint has elements.
Let be one such set, such that is maximal. It follows that every -extension of agrees with some element of on some vertex in . Therefore has at most -extensions, where is the number of -extensions of that agree with on precisely . By the inductive hypothesis, is w.h.p. for all and (independent of ), so has -extensions w.h.p. by a union bound. ∎
Lemma 6.11**.**
If is a -base then any has -extensions w.h.p.
Proof.
Again, assume that is an induced subgraph of and condition on . Also assume without loss of generality that is strictly positive on . The proof is by induction on , for all . The base case is trivial. Fix an arbitrary vertex . First we consider the case where . The number of -extensions of equals the sum over all \gamma\in\{\text{\Gamma(A\cup v)\mathcal{A}}\} of the number of -extensions of . Clearly is a -base, and Lemma 6.8 implies that is a -base. It follows from our assumptions that , so we can apply the inductive hypothesis twice: w.h.p. has extensions to , each of which has extensions to , and the result follows.
Now assume that . Lemma 6.10 implies that has -extensions w.h.p. for any , so it suffices to show that w.h.p. there are values of such that extends to . Let , and if has \tilde{O}\mathopen{}\mathclose{{}\left(n^{\Delta(U-v)-\Delta(A)}}\right) extensions to when then let be “okay”. Since is a -base, is okay w.h.p. by the inductive hypothesis. Let \mathbf{Z}_{i}=\mathbbm{1}\!\{\text{\mathcal{A}\cup v_{i}U}\}, and let be the event that . Then,
[TABLE]
so it suffices to prove that for all okay .
The are independent Bernoulli random variables (given ). By a union bound, is at most the number of -extensions of times the probability that the requisite edges between any one of them and are in , i.e.
[TABLE]
Since is a -base, , so it follows from Lemma 6.4 that is w.h.p. ∎
Remark*.*
It follows from Lemma C.1 that Lemma 6.11 is essentially tight.
Now we prove that is good w.h.p.:
Proof of Theorem 6.3.
Let , and . By a union bound it suffices to prove that w.h.p. there are values of such that extends to . The number of such is at most the number of such that extends to , which is at most the number of -extensions of . Since (Lemma 6.9), this equals the sum over all \gamma\in\{\text{\Gamma(A)\mathcal{A}}\} of the number of -extensions of .
It follows from Lemma 6.10 that has extensions to w.h.p. (To see this, note that if then (Lemma 6.7), and if then it follows from the definition of that , a contradiction.) Since is a -base (Lemma 6.8), it follows from Lemma 6.11 that any is w.h.p. ( by Lemma 6.7). ∎
6.2 The Circuit
If is a data structure then let denote the number of bits used to represent it according to whatever schema we describe. If is a bit array and is a bit then let and for all . When there is a null element we represent it by the all-zeros string.
We now prove Theorem 6.1, i.e. that there exists an circuit with wires that solves \text{G\mathsf{SUB}}(\mathbf{X}_{\Delta,n}) w.h.p. Since is good w.h.p. (Theorem 6.3) it suffices to prove the existence of a small circuit such that P_{X\sim\mathbf{X}_{\Delta,n}}(\mathsf{C}(X)=\text{G\mathsf{SUB}}(X)\mid\text{X is good})=1-n^{-\omega(1)}. By Yao’s Principle [Yao77] it suffices to prove the existence of a small, random circuit such that P(\mathbf{C}(X)=\text{G\mathsf{SUB}}(X))=1-n^{-\omega(1)} for all fixed good . More precisely,
[TABLE]
The following result is essentially implicit in [LRR17] (as is the argument above) and helps keep the random circuit small:
Lemma 6.12** (Random Hashing).**
Let be a set containing a null element, and assume all elements of are represented using the same number of bits. Let and be functions of . Then there exists a random circuit such that if is an array of values in , of which all but at most are null, then has at most gates and wires, and w.h.p. the multiset of non-null elements of is the same as that of .
We remark that Lemma 6.12 will only be called with .
Proof.
The result is trivial if (simply return ) so assume otherwise. Let be a uniform random function. Let be an array of values in , where is the ’th non-null element of if this set has at least elements, and is null otherwise. Each of the at most non-null elements of is independently in with probability , so for any particular , the sub-array is large enough to store the non-null elements of w.h.p. (Lemma 6.4). It follows from a union bound that has the same non-null elements as w.h.p. Also assume that is for all ; this occurs w.h.p. by Lemma 6.4. Under these conditions it suffices to compute , and this can be done as follows.
For let T_{k}^{N}(x)=\mathbbm{1}\!\{\text{xk ones}\}. Then can be computed by applying a single OR gate to all elements of , and
[TABLE]
Fact 6.13** ([Hås+94, Theorem 6]).**
If for constant , then can be computed for by monotone unbounded fan-in circuits of depth with gates, where , and wires.
Let , and let be a constant such that the dimensions of are at most where . Let be the -size (hence -size) circuit from Fact 6.13 that computes . Observe that where is an arbitrary fixed string with exactly ones that can be hard-coded in. Therefore can be computed by an circuit of size \sum_{i\in[|\mathbf{h}^{-1}(p)|]}\mathopen{}\mathclose{{}\left(n^{o(1)}+|\mathbf{A}^{(p)}[i]|}\right)\leq\mathopen{}\mathclose{{}\left|\mathbf{A}^{(p)}}\right|n^{o(1)}. Summing over and , the total number of gates is . To count wires instead of gates, replace with . ∎
Given and an ordering of , we can represent as a tree in the following way. Start with a rooted, depth- tree (meaning the root has depth 0 and the leaves have depth ) in which each interior node has unordered children labeled . Then take the induced subtree of this tree on the union of all root-to-leaf paths such that666Recall that is a -colored vertex in . are the vertices of an -colored subgraph of .
With respect to an implicit and , let for , and let for .
Lemma 6.14**.**
* for all .*
Proof.
Clearly . Let such that and . Then . ∎
Let be a depth- tree in which each node at depth has children, where is a sufficiently large constant. Each non-root node has a label , and the root is labeled “root”. It is required that if we ignore the null nodes of , then is isomorphic to the tree representation of described above.
If the underlying tree structure of (that is, everything except the labels) is implicit, then we can represent by an array of values in , indexed by the nodes of . Each of these values can be associated with a bit string in a natural way. We will consider circuits that compute according to this representation.
Let be an immediate subtree of (resp. of a node ), denoted (resp. ), if ’s root is a child of ’s root (resp. of ). Any subtree is considered to have the same label as its root.
Lemma 6.15**.**
* is .*
Proof.
and . It takes bits to store an element of , and each is nonnegative (Lemma 6.14), so
[TABLE]
Lemma 6.16**.**
For all and orderings of , there exists a random circuit, independent of , with wires, that computes from w.h.p.
Proof.
Assume that and differ only in positions and . (The general case can be reduced to at most copies of this circuit in succession.) Define and analogously to and , but with respect to rather than . Clearly for , so for .
For each depth- node of , in parallel, do the following. For let \tau^{\prime}_{\sigma j}=\bigvee_{\tau\in\sigma}\mathopen{}\mathclose{{}\left((\mathcal{L}(\tau)=j)\wedge\tau^{(\mathcal{L}(\sigma))}}\right), where is formed from by replacing its (root’s) label with . Let be the tree whose immediate subtrees are for , and whose label is \mathopen{}\mathclose{{}\left(\bigvee_{\sigma\in N}\bigvee_{\tau\in\sigma}(\mathcal{L}(\tau)=j)}\right)\wedge\overline{j} where is the bit-string representation of . Hash the number of immediate subtrees of down to for each in parallel, and hash the number of down to . (The hashing uses Lemma 6.12 and succeeds w.h.p. because is good; also note that .) Finally, the new children of are the remaining .
Computing takes wires, so computing takes wires, and doing this for all and takes wires (Lemma 6.15). The hashing increases the number of wires by a factor of . ∎
For we can construct as follows. Suppose we’re given the adjacency matrix such that . Let . Let be the tree with children for , and label \mathopen{}\mathclose{{}\left(\bigvee_{i}A_{ij}}\right)\wedge\overline{j}. This setup is equivalent to the situation immediately before the hashing in the proof of Lemma 6.16, and the rest of the construction is the same. This takes wires, including the hashing.
Lemma 6.17**.**
For all and orderings and of and respectively, there exists a random circuit, independent of , with wires, that computes from and w.h.p. for some .
Proof.
Let and . By Lemma 6.16 we can assume without loss of generality that , and that for . Define and with respect to and respectively.
Let . For let be a depth- tree in which each node at depth (including ) has children. Again, each non-root node of has a label , and the root is labeled “root”. It is required that if we ignore null nodes, then is isomorphic to the intersection of the depth- truncations of and . Furthermore, each leaf of is associated with the pair of subtrees of and such that the -to- path in , the -to- path in , and the -to- path in are all the same sequence of labels.
The tree is the single node , and we can compute from by doing the following for each leaf of in parallel. Assume without loss of generality that . (If , reverse the roles of and in the following construction.) For let be the immediate subtree of with the same label as (if this exists), i.e. . Replace with a new node with children for all . Assign the node replacing the same label as , and assign the same label as and .
Computing takes \tilde{O}\mathopen{}\mathclose{{}\left(\sum_{\tau^{\prime}\in\sigma^{\prime}}|\tau^{\prime}|}\right)=\tilde{O}(|\sigma^{\prime}|) wires, and there are at most values of (Lemma 6.14), so computing takes wires. Given , computing the leaves of the replacement for takes O\mathopen{}\mathclose{{}\left(\sum_{\tau\in\sigma}(|\tau|+|\rho_{\tau}|)}\right)=O(|\sigma|+|\sigma^{\prime}|) wires. Since the roles of and might be reversed above, all of this takes at most wires. Since has \tilde{O}\mathopen{}\mathclose{{}\left(n^{\sum_{i<d}\psi_{i}}}\right) leaves, the number of wires is at most
[TABLE]
Let . For from down to 0, for each depth- node in , hash (Lemma 6.12) the number of immediate subtrees of down from to , and if all of ’s children are null and then set to null. (We remark that by Lemma C.3.) This takes wires (Lemma 6.15). By induction on , a node retains its label if and only if it should retain its label in , so the hashing succeeds w.h.p. because is good.
Finally, for each leaf of , append a copy of to each leaf of , and put this in place of in . This operation is purely semantic and requires no wires. The resulting tree does in fact have the proper number of children per node to be by Lemma C.4,777For apply Lemma C.4 with , and for apply Lemma C.4 with . but without this knowledge we could instead use hashing on and as above, without knowing whether or not it succeeds vacuously. ∎
For each successive in an optimal union sequence, compute as described above, and then apply a single OR gate to all leaves of .
Acknowledgments
Thanks to Benjamin Rossman for introducing me to this topic, and for having many helpful discussions about the research and about drafts of this paper. Thanks to Henry Yuen and the anonymous reviewers for their feedback as well. Part of this work was done while the author was visiting the Simons Institute for the Theory of Computing.
Appendix A Equivalence of Threshold Weightings and Markov Chains
Theorem A.1**.**
For any threshold weighting there exists a function such that
* for all ,* 2. 2.
* for all , and* 3. 3.
* for all .*
Proof.
Let . The proof is by induction on . If is a single vertex then consists only of , so setting satisfies the requirements. Now assume . For let (once is specified). Assume without loss of generality that is a clique, since we can assign on nonexistent edges.
Let , where ties are broken arbitrarily subject to being an induced subgraph of . Since ,
[TABLE]
so for we can define such that . For let , and let be the restriction of to subgraphs of . For any ,
[TABLE]
with equality if . Therefore is a threshold weighting on . Recursively define a restriction of to such that this restriction is a Markov Chain on that is equivalent to .
For let . For let , and let be the restriction of to subgraphs of . Then,
[TABLE]
For any , if then
[TABLE]
and if then . Therefore is a threshold weighting on . Recursively define a restriction of to such that this restriction is a Markov Chain on that is equivalent to .
We now verify that for all ; the other requirements follow easily by induction. If then by induction, and by the definition of . Similarly, if then and . Therefore for all . ∎
Appendix B Proof that \kappa\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right) is for all
The proof below is self-contained; however in places with clear analogues in Section 5.2 we will give less detailed explanations of the intermediate steps and intuition.
Fix and . Let a query tree be a binary tree in which each node is labeled with some where each . The root is labeled with , each leaf is labeled with a singleton set, and for any interior node labeled with there exist and such that the left and right children of are labeled with and respectively. (In the latter case, necessarily has at least two elements.)
With respect to an implicit query tree , let be the leaves in increasing order from left to right, and for let . Let and let be the maximum of over all query trees . For a threshold weighting \Delta\in\theta\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right) and let , and if is a single-vertex graph or the empty graph then let . By Item 3.8(i) it suffices to prove that \kappa_{\Delta}\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right) is for all threshold weightings .
Lemma B.1**.**
Fix a query tree . Let such that are exactly the leaves descended from some node of . Let \Delta=(1,\beta)\in\theta\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right) such that and , and . Then .
Proof.
Let be the node of such that the leaves descended from are exactly . Let be the label of . Let .
The proof is by induction on , for all query trees . (It follows from the definitions that , with equality for all if and only if is a leaf.) In the inductive step we handle separately the cases where and . The base case is a special case of the former because if is a single vertex then and are both zero.
Case 1: . If is a single vertex then ; we now obtain the same result in the case where is not a single vertex. Let , and note that is nonempty. For and let . Choose a pair uniformly at random out of all pairs such that and . Each edge in is also in with the same probability (since adjacent vertices differ in a unique coordinate), so by linearity of expectation,
[TABLE]
Therefore for some fixed and .
Now our claim that follows from two applications of the inductive hypothesis. Let be any query tree in which the sequence of labels along the path from the root to the leftmost leaf includes followed by . With respect to , an application of the inductive hypothesis with and reveals that , and then an application of the inductive hypothesis with and () reveals that .
The rest of the proof is essentially identical to the case . Let be an optimal (with respect to ) union sequence for , followed by an optimal union sequence for , followed by , where the are the edges between and in . (If or lacks edges then omit certain graphs from this sequence.) Then,
[TABLE]
We proceed to bound each of these three terms by , completing the proof. We have assumed that , and proved that . We have also assumed that , and since and both evaluate to 1 on all vertices, it follows that (with the last step following from the definition of ). Similarly, , and it follows that .
Case 2: . For and let (where and are defined as above). Choose a pair uniformly at random out of all pairs such that and . Then there exist (specifically, , , and ) such that
[TABLE]
Therefore for some fixed and .
Preparing to apply the inductive hypothesis, let be any query tree structured and labeled exactly like on all ancestors of for all , and on all ancestors of , but now the left child of is labeled with . With respect to , an application of the inductive hypothesis with and reveals that , and a second application of the inductive hypothesis with and () reveals that . ∎
Lemma B.2**.**
* is .*
Proof.
We use a cruder bound here than in the case . Let be an arbitrary query tree and . Let be the nearest common ancestor of and . If is a leaf then clearly is , so assume otherwise. Let and be the left and right children of , and note that is a descendant of . By Eq. 1, since is -regular it suffices to prove that is . Suppose is labeled with and is labeled with . Since each vertex in is a descendant of , all edges between and are in one of the following classes:
Edges (in ) between a leaf descended from and a leaf not descended from . Each leaf descended from is adjacent to leaves not descended from , so this amounts to edges in total. By the AM-GM inequality, this is at most
[TABLE] 2. 2.
Edges (in ) between a leaf descended from and a leaf descended from . Each leaf descended from is adjacent to one leaf descended from , so this amounts to at most edges in total. 3. 3.
Edges (in ) between a leaf descended from that’s in , and a leaf descended from that’s in . This is at most what the value of would be if were instead. (Eliminate coordinate , and replace with for all .)
The total number of edges in all classes is therefore . ∎
Finally, it follows from Lemmas B.1 and B.2 that \kappa\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)\leq 2\mu\leq O(q^{d}/d).
Remark*.*
The above argument holds even if we relax the definition of threshold weightings to allow to take on negative values (where all definitions in terms of threshold weightings are with respect to this revised definition).
Appendix C Properties of Threshold Weightings and Threshold Random Graphs
Lemma C.1**.**
If are fixed graphs, , and for all , then conditional on , there are at least -extensions of a.a.s.
Li et al. [LRR17] stated without proof that a similar result can be obtained using Janson’s Inequality [Jan90]:
Fact C.2** (Janson’s Inequality).**
Let be independent Bernoulli random variables, let , and for let . Also for let if . Let and . Then for all ,
[TABLE]
Proof of Lemma C.1.
Let be the possible -extensions of , and let . Define as in Fact C.2; clearly , by reasoning similar to the proof of Lemma 3.5. If then the projection of onto must be some graph in , so
[TABLE]
Since is also , it follows that \mu^{2}\big{/}\mathopen{}\mathclose{{}\left(\mu+\sum_{i\sim j}\mathbb{E}[\mathbf{I}_{i}\mathbf{I}_{j}]}\right)\geq\mu^{2}/o(\mu^{2})=\omega(1), and the result follows from Fact C.2. ∎
Lemma C.3**.**
For all and ,
[TABLE]
Proof.
Since it follows that , and since and it follows that . So by Lemmas 6.5 and 6.7,
[TABLE]
Lemma C.4**.**
Let and assume and . Then, .
Proof.
For all and ,
[TABLE]
so by Lemma 6.5,
[TABLE]
The same reasoning applies with in place of , so
[TABLE]
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[AM 11] Noga Alon and Dániel Marx “Sparse balanced partitions and the complexity of subgraph problems” In SIAM J. Discrete Math. 25.2 , 2011, pp. 631–644 DOI: 10.1137/100812653 · doi ↗
- 2[AM 85] Noga Alon and Vitali D. Milman “ λ 1 , subscript 𝜆 1 \lambda_{1}, isoperimetric inequalities for graphs, and superconcentrators” In J. Combin. Theory Ser. B 38.1 , 1985, pp. 73–88 DOI: 10.1016/0095-8956(85)90092-9 · doi ↗
- 3[Ama 10] Kazuyuki Amano “ k 𝑘 k -subgraph isomorphism on AC 0 superscript AC 0 \rm AC^{0} circuits” In Comput. Complexity 19.2 , 2010, pp. 183–210 DOI: 10.1007/s 00037-010-0288-y · doi ↗
- 4[AYZ 95] Noga Alon, Raphael Yuster and Uri Zwick “Color-coding” In J. ACM 42.4 , 1995, pp. 844–856 DOI: 10.1145/210332.210337 · doi ↗
- 5[BK 08] Hans L. Bodlaender and Arie M… Koster “Combinatorial optimization on graphs of bounded treewidth” In Comput. J. 51.3 OUP, 2008, pp. 255–269 DOI: 10.1093/comjnl/bxm 037 · doi ↗
- 6[Bod 98] Hans L. Bodlaender “A partial k 𝑘 k -arboretum of graphs with bounded treewidth” In Theoret. Comput. Sci. 209.1-2 , 1998, pp. 1–45 DOI: 10.1016/S 0304-3975(97)00228-4 · doi ↗
- 7[CK 06] L. Chandran and Telikepalli Kavitha “The treewidth and pathwidth of hypercubes” In Discrete Math. 306.3 , 2006, pp. 359–365 DOI: 10.1016/j.disc.2005.12.011 · doi ↗
- 8[EG 04] Friedrich Eisenbrand and Fabrizio Grandoni “On the complexity of fixed parameter clique and dominating set” In Theoret. Comput. Sci. 326.1-3 , 2004, pp. 57–67 DOI: 10.1016/j.tcs.2004.05.009 · doi ↗
