Beating Treewidth for Average-Case Subgraph Isomorphism

Gregory Rosenthal

arXiv:1902.06380·cs.CC·November 4, 2020

Beating Treewidth for Average-Case Subgraph Isomorphism

Gregory Rosenthal

PDF

TL;DR

This paper demonstrates that for certain graphs like hypercubes, the average-case complexity of the subgraph isomorphism problem can be significantly lower than the worst-case treewidth-based bounds, by analyzing related graph parameters.

Contribution

It proves that the embedding parameter is bounded by the complexity parameter, shows that the complexity parameter can be asymptotically less than treewidth, and constructs circuits that solve the problem efficiently on average.

Findings

01

Average-case complexity for hypercubes is sublinear in treewidth.

02

The embedding parameter is bounded by the complexity parameter.

03

Constructed circuits match upper and lower bounds for average-case complexity.

Abstract

For any fixed graph $G$ , the subgraph isomorphism problem asks whether an $n$ -vertex input graph has a subgraph isomorphic to $G$ . A well-known algorithm of Alon, Yuster and Zwick (1995) efficiently reduces this to the "colored" version of the problem, denoted $G$ - $SUB$ , and then solves $G$ - $SUB$ in time $O (n^{tw (G) + 1})$ where $tw (G)$ is the treewidth of $G$ . Marx (2010) conjectured that $G$ - $SUB$ requires time $Ω (n^{const \cdot tw (G)})$ and, assuming the Exponential Time Hypothesis, proved a lower bound of $Ω (n^{const \cdot e mb (G)})$ for a certain graph parameter $e mb (G) \geq Ω (tw (G) / lo g tw (G))$ . With respect to the size of $AC^{0}$ circuits solving $G$ - $SUB$ in the average case, Li, Razborov and Rossman (2017) proved (unconditional) upper and lower bounds of $O (n^{2 κ (G) + const})$ and…

Equations120

h (G) = \emptyset \subset U \subset V (G) min \frac{e ( U , V ( G ) - U )}{min ( ∣ U ∣ , ∣ V ( G ) - U ∣ )} .

h (G) = \emptyset \subset U \subset V (G) min \frac{e ( U , V ( G ) - U )}{min ( ∣ U ∣ , ∣ V ( G ) - U ∣ )} .

0 \leq Δ (uv) = α (u) + α (v) - β (uv) \leq 2 - β (uv) .

0 \leq Δ (uv) = α (u) + α (v) - β (uv) \leq 2 - β (uv) .

Δ (H) = v \in V (H) uv \in E (G) - E (H) \sum M_{u, v} \geq 0,

Δ (H) = v \in V (H) uv \in E (G) - E (H) \sum M_{u, v} \geq 0,

α^{'} (u)

α^{'} (u)

β^{'} (uv)

\displaystyle\kappa\mathopen{}\mathclose{{}\left(G^{\mathopen{}\mathclose{{}\left(q}\right)}}\right)

\displaystyle\kappa\mathopen{}\mathclose{{}\left(G^{\mathopen{}\mathclose{{}\left(q}\right)}}\right)

\leq q max (κ (G), 2) . \qed

e(H)=\Theta(v(H))\leq O(\kappa(H))\leq O\mathopen{}\mathclose{{}\left(\kappa\mathopen{}\mathclose{{}\left(G^{\mathopen{}\mathclose{{}\left(\lceil e(H)/r\rceil}\right)}}\right)}\right)\leq O\mathopen{}\mathclose{{}\left(\kappa(G)e(H)/r}\right),

e(H)=\Theta(v(H))\leq O(\kappa(H))\leq O\mathopen{}\mathclose{{}\left(\kappa\mathopen{}\mathclose{{}\left(G^{\mathopen{}\mathclose{{}\left(\lceil e(H)/r\rceil}\right)}}\right)}\right)\leq O\mathopen{}\mathclose{{}\left(\kappa(G)e(H)/r}\right),

E [β (K_{k} [U_{i - 1}])] = e \in E (K_{k} [U_{i}]) \sum β (e) p_{i} = p_{i} β (K_{k} [U_{i}]) \geq p_{i} β_{o} (K_{k} [U_{i}]) = E [β_{o} (K_{k} [U_{i - 1}])] .

E [β (K_{k} [U_{i - 1}])] = e \in E (K_{k} [U_{i}]) \sum β (e) p_{i} = p_{i} β (K_{k} [U_{i}]) \geq p_{i} β_{o} (K_{k} [U_{i}]) = E [β_{o} (K_{k} [U_{i - 1}])] .

H \in S max Δ (H) \leq i max Δ (K_{k} [U_{i}]) + 1 \leq i max Δ_{o} (K_{k} [U_{i}]) + 1.

H \in S max Δ (H) \leq i max Δ (K_{k} [U_{i}]) + 1 \leq i max Δ_{o} (K_{k} [U_{i}]) + 1.

\Delta_{\mathrm{o}}(K_{k}[U_{i}])=\frac{i(k-i)}{k-1}\leq\frac{k^{2}}{4(k-1)}=\frac{1}{4}\mathopen{}\mathclose{{}\left(k+1+\frac{1}{k-1}}\right)\leq\frac{k+2}{4}=k/4+O(1).

\Delta_{\mathrm{o}}(K_{k}[U_{i}])=\frac{i(k-i)}{k-1}\leq\frac{k^{2}}{4(k-1)}=\frac{1}{4}\mathopen{}\mathclose{{}\left(k+1+\frac{1}{k-1}}\right)\leq\frac{k+2}{4}=k/4+O(1).

(x_{1}, \dots, x_{d}) \mapsto ((ϕ_{L} (x_{1}), \dots, ϕ_{L} (x_{d})), ψ (ϕ_{R} (x_{1}), \dots, ϕ_{R} (x_{d}))) .

(x_{1}, \dots, x_{d}) \mapsto ((ϕ_{L} (x_{1}), \dots, ϕ_{L} (x_{d})), ψ (ϕ_{R} (x_{1}), \dots, ϕ_{R} (x_{d}))) .

\kappa\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)\leq\kappa\mathopen{}\mathclose{{}\left(Q_{d}^{\mathopen{}\mathclose{{}\left((q/2)^{d}}\right)}}\right)\leq O\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left(\frac{q}{2}}\right)^{d}\kappa(Q_{d})}\right)\leq O\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left(\frac{q}{2}}\right)^{d}\frac{2^{d}}{d}}\right)=O(q^{d}/d).

\kappa\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)\leq\kappa\mathopen{}\mathclose{{}\left(Q_{d}^{\mathopen{}\mathclose{{}\left((q/2)^{d}}\right)}}\right)\leq O\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left(\frac{q}{2}}\right)^{d}\kappa(Q_{d})}\right)\leq O\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left(\frac{q}{2}}\right)^{d}\frac{2^{d}}{d}}\right)=O(q^{d}/d).

κ (Q_{d})

κ (Q_{d})

\leq 2 μ

< 4/3 \cdot 2^{d} / d .

E [β (B (i, b))] = e \in E (B) \sum P (e \in B (i, b)) β (e) = pβ (B) .

E [β (B (i, b))] = e \in E (B) \sum P (e \in B (i, b)) β (e) = pβ (B) .

E [β (B (i, b))] = pβ (B) \geq p β_{o} (B) = E [β_{o} (B (i, b))] .

E [β (B (i, b))] = pβ (B) \geq p β_{o} (B) = E [β_{o} (B (i, b))] .

H \in S max Δ (H) \leq max (κ_{Δ} (G (a)), κ_{Δ} (B), Δ (G (a)) + Δ (B)) .

H \in S max Δ (H) \leq max (κ_{Δ} (G (a)), κ_{Δ} (B), Δ (G (a)) + Δ (B)) .

E [β (H (i, b))]

E [β (H (i, b))]

= \frac{1}{2} β (G (a)) + \frac{1}{2} β (G (a + 2^{k})) - \frac{1}{2 k} β (B)

> \frac{1}{2} β_{o} (G (a)) + \frac{1}{2} β_{o} (G (a + 2^{k})) - \frac{1}{2 k} β_{o} (B)

= E [β_{o} (H (i, b))] .

e (G (0, a), G (a, 2^{d})) = e (G (0, 2^{d} - a), G (2^{d} - a, 2^{d}))

e (G (0, a), G (a, 2^{d})) = e (G (0, 2^{d} - a), G (2^{d} - a, 2^{d}))

e (G (0, a), G (a, 2^{d}))

e (G (0, a), G (a, 2^{d}))

= e (G (0, a), G (a, 2^{d - 1})) + v (G (0, a))

= e (G (0, a), G (a, 2^{d - 1})) + a .

e (G (0, 2^{d - 1} - a), G (2^{d - 1} - a, 2^{d})) = e (G (0, 2^{d - 1} - a), G (2^{d - 1} - a, 2^{d - 1})) + 2^{d - 1} - a .

e (G (0, 2^{d - 1} - a), G (2^{d - 1} - a, 2^{d})) = e (G (0, 2^{d - 1} - a), G (2^{d - 1} - a, 2^{d - 1})) + 2^{d - 1} - a .

e (G (0, 2^{d - 1} - a), G (2^{d - 1} - a, 2^{d})) - e (G (0, a), G (a, 2^{d})) = 2^{d - 1} - 2 a > 0.

e (G (0, 2^{d - 1} - a), G (2^{d - 1} - a, 2^{d})) - e (G (0, a), G (a, 2^{d})) = 2^{d - 1} - 2 a > 0.

e (G (0, a), G (a, 2^{d}))

e (G (0, a), G (a, 2^{d}))

= e (G (2^{d - 2}, a), G (a, 2^{d - 1})) + e (G (0, 2^{d - 2}), G (a, 2^{d - 1})) + a

= e (G (2^{d - 2}, a), G (a, 2^{d - 1})) + 2^{d - 1} .

P (S \geq r)

P (S \geq r)

\leq exp (- t r) i \prod exp (p_{i} e^{t}) = exp (- t r + μ e^{t}) .

P (S \geq r) \leq (e / lo g^{2} n)^{l o g^{2} n} \leq (1/ e)^{l o g^{2} n} = n^{- l o g n} . \qed

P (S \geq r) \leq (e / lo g^{2} n)^{l o g^{2} n} \leq (1/ e)^{l o g^{2} n} = n^{- l o g n} . \qed

2 Δ^{*} (A) = Δ (B) + Δ (C) = Δ (B \cap C) + Δ (B \cup C) \geq Δ (B \cap C) + Δ^{*} (A),

2 Δ^{*} (A) = Δ (B) + Δ (C) = Δ (B \cap C) + Δ (B \cup C) \geq Δ (B \cap C) + Δ^{*} (A),

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

{NoHyper}

Beating Treewidth for Average-Case Subgraph Isomorphism

Gregory Rosenthal

University of Toronto Email: [email protected]. Supported by NSERC (PGS D).

Abstract

For any fixed graph $G$ , the subgraph isomorphism problem asks whether an $n$ -vertex input graph has a subgraph isomorphic to $G$ . A well-known algorithm of Alon, Yuster and Zwick (1995) efficiently reduces this to the “colored” version of the problem, denoted $G$ - $\mathsf{SUB}$ , and then solves $G$ - $\mathsf{SUB}$ in time $O(n^{\mathit{tw}(G)+1})$ where $\mathit{tw}(G)$ is the treewidth of $G$ . Marx (2010) conjectured that $G$ - $\mathsf{SUB}$ requires time $\Omega(n^{\mathrm{const}\cdot\mathit{tw}(G)})$ and, assuming the Exponential Time Hypothesis, proved a lower bound of $\Omega(n^{\mathrm{const}\cdot\mathit{emb}(G)})$ for a certain graph parameter $\mathit{emb}(G)\geq\Omega(\mathit{tw}(G)/\log\mathit{tw}(G))$ . With respect to the size of $\mathrm{AC}^{0}$ circuits solving $G$ - $\mathsf{SUB}$ in the average case, Li, Razborov and Rossman (2017) proved (unconditional) upper and lower bounds of $O(n^{2\kappa(G)+\mathrm{const}})$ and $\Omega(n^{\kappa(G)})$ for a different graph parameter $\kappa(G)\geq\Omega(\mathit{tw}(G)/\log\mathit{tw}(G))$ .

Our contributions are as follows. First, we prove that $\mathit{emb}(G)$ is $O(\kappa(G))$ for all graphs $G$ . Next, we show that $\kappa(G)$ can be asymptotically less than $\mathit{tw}(G)$ ; for example, if $G$ is a hypercube then $\kappa(G)$ is $\Theta\mathopen{}\mathclose{{}\left(\mathit{tw}(G)\big{/}\sqrt{\log\mathit{tw}(G)}}\right)$ . This implies that the average-case complexity of $G$ - $\mathsf{SUB}$ is $n^{o(\mathit{tw}(G))}$ when $G$ is a hypercube. Finally, we construct $\mathrm{AC}^{0}$ circuits of size $O(n^{\kappa(G)+\mathrm{const}})$ that solve $G$ - $\mathsf{SUB}$ in the average case, closing the gap between the upper and lower bounds of Li et al.

1 Introduction

The subgraph isomorphism problem asks, given graphs $X$ and $G$ , whether $X$ has a subgraph isomorphic to $G$ . In the “colored” or “partitioned” version of the problem, each vertex of the larger graph $X$ comes with a “color” from the vertex set of $G$ , and we ask whether $X$ has a subgraph that is isomorphic to $G$ with respect to this coloring. We denote the uncolored and colored subgraph isomorphism problems by $\text{$ G $-$ \mathsf{SUB}_{\mathrm{uncol}} $}(X)$ and $\text{$ G $-$ \mathsf{SUB} $}(X)$ respectively.

Subgraph isomorphism is NP-complete (e.g. if $G$ is a clique or Hamiltonian cycle), so research has focused on algorithms for a variety of special cases in the context of parameterized complexity, surveyed in [MP14]. If $G$ is a fixed graph on $k$ vertices then $G$ - $\mathsf{SUB}_{\mathrm{uncol}}$ is solvable in time $O(n^{k})$ by brute force, where (here and throughout this section) $n$ is the order of the input graph. The color-coding algorithm of Alon, Yuster and Zwick [AYZ95] improves on this by efficiently reducing $G$ - $\mathsf{SUB}_{\mathrm{uncol}}$ to $G$ - $\mathsf{SUB}$ and solving the latter in time $O(n^{\mathit{tw}(G)+1})$ , where $\mathit{tw}(G)$ is the treewidth of the fixed graph $G$ .

The exponent $\mathit{tw}(G)+1$ can sometimes be improved using fast matrix multiplication [NP85, EG04], but no significantly faster algorithm is known for either the colored or uncolored subgraph isomorphism problem. Marx [Mar10] conjectured the following:

Conjecture 1.1.

There is no class $\mathcal{G}$ of graphs with unbounded treewidth, no algorithm $\mathbb{A}$ that on inputs $G$ and $X$ solves $\text{$ G $-$ \mathsf{SUB} $}(X)$ , and no function $f$ such that if $G$ is in $\mathcal{G}$ then $\mathbb{A}$ runs in time $f(G)n^{o(\mathit{tw}(G))}$ .

Marx [Mar10] came close to proving 1.1 assuming the Exponential Time Hypothesis (ETH) [IPZ01], which is the hypothesis that solving 3SAT on $n$ variables requires $2^{\Omega(n)}$ time. We state his result in terms of a parameter $\mathit{emb}(G)$ (short for “embedding”) which we will define in Section 4:

Theorem 1.2 ([Mar10]).

Assuming ETH, there is no class $\mathcal{G}$ of graphs with unbounded treewidth, no algorithm $\mathbb{A}$ that on inputs $G$ and $X$ solves $\text{$ G $-$ \mathsf{SUB} $}(X)$ , and no function $f$ such that if $G$ is in $\mathcal{G}$ then $\mathbb{A}$ runs in time $f(G)n^{o(\mathit{emb}(G))}$ .

Marx [Mar10] proved that $\mathit{emb}(G)$ is $\Omega(\mathit{tw}(G)/\log\mathit{tw}(G))$ , so Theorem 1.2 comes within a logarithmic factor in the exponent of proving 1.1 (under ETH). However, our results include a counterexample to an average-case analogue of 1.1, in a sense that will be made precise in Section 3. Moreover, this counterexample holds in $\mathrm{AC}^{0}$ , i.e. on unbounded-fanin boolean circuits of depth depending only on $G$ .

Li, Razborov and Rossman [LRR17] proved that for fixed $G$ , the average-case $\mathrm{AC}^{0}$ complexity of $G$ - $\mathsf{SUB}$ is between $n^{\kappa(G)-o(1)}$ and $n^{2\kappa(G)+c}$ , where $\kappa(G)$ is a graph property and $c$ is an absolute constant.111In [LRR17], the parameter $\kappa(G)$ was called $\kappa_{\mathrm{col}}(G)$ . (See Section 3 for Li et al.’s definition of $\kappa(G)$ ; we also prove that $\kappa(G)$ can be equivalently defined in terms of the transition matrix of a certain random walk on $G$ .) We tighten this gap, answering a question posed in [LRR17]:

Theorem 1.3.

There is a constant $c>0$ such that for any fixed graph $G$ , the average-case $\mathrm{AC}^{0}$ complexity of $G$ - $\mathsf{SUB}$ is at most $n^{\kappa(G)+c}$ .

We observe that a similar result holds easily on Turing machines, using as a subroutine the sort-merge join algorithm from relational algebra. This involves sorting, which cannot be done in (polynomial-size) $\mathrm{AC}^{0}$ [Hås86], so our circuit instead uses hashing that relies on concentration of measure for subgraphs of random graphs.

Li et al. [LRR17] also proved that $\kappa(G)$ is between $\Omega(\mathit{tw}(G)/\log\mathit{tw}(G))$ and $\mathit{tw}(G)+1$ , from which it follows that the worst-case complexity of $G$ - $\mathsf{SUB}$ is at least $n^{\Omega(\mathit{tw}(G)/\log\mathit{tw}(G))}$ in $\mathrm{AC}^{0}$ . Li et al. posed the question of whether $\kappa(G)$ is $\Theta(\mathit{tw}(G))$ ; an affirmative answer would have implied that 1.1 holds in $\mathrm{AC}^{0}$ .

However, the following example separates $\kappa$ from treewidth. The Hamming graph $K_{q}^{d}$ has vertex set $\{1,\dotsc,q\}^{d}$ and edges between every two vertices that differ in exactly one coordinate. It is already known that $K_{q}^{d}$ has treewidth $\Theta\mathopen{}\mathclose{{}\left(q^{d}\big{/}\sqrt{d}}\right)$ [CK06]. We prove the following:

Theorem 1.4.

$\kappa\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)$ * is $\Theta(q^{d}/d)$ .*

Thus, if $G$ is the hypercube graph $K_{2}^{d}$ for example, then $\kappa(G)$ is $\Theta\mathopen{}\mathclose{{}\left(\mathit{tw}(G)\big{/}\sqrt{\log\mathit{tw}(G)}}\right)$ . It follows that an average-case analogue of 1.1 is false if $\mathcal{G}$ is taken to be the set of all hypercubes. We also prove the following (for arbitrary graphs $G$ ):

Theorem 1.5.

$\mathit{emb}(G)$ * is $O(\kappa(G))$ .*

Because of Theorem 1.5, even if our upper bound generalizes to the worst case, it is still consistent with current knowledge (in particular Theorem 1.2) that ETH is true. Another consequence of Theorem 1.5 is that the lower bound from Theorem 1.2 holds unconditionally in $\mathrm{AC}^{0}$ .

It follows from Theorems 1.4 and 1.5 that if $G$ is a hypercube then $\mathit{emb}(G)\leq O(\kappa(G))\leq o(\mathit{tw}(G))$ , so proving that 1.1 holds under ETH cannot be done by proving that $\mathit{emb}(G)$ is $\Theta(\mathit{tw}(G))$ . In fact, this conclusion was already known: Alon and Marx [AM11] proved that if $G$ is a 3-regular expander then $\mathit{emb}(G)$ is $\Theta(\mathit{tw}(G)/\log\mathit{tw}(G))$ . Li et al. [LRR17] proved that if $G$ is a 3-regular expander then $\kappa(G)$ is $\Theta(\mathit{tw}(G))$ , which makes our separation of $\kappa$ from treewidth more surprising. On the other hand, we will see that Theorem 1.5 is tight in the case of Hamming graphs.

We can make a similar statement regarding $\mathrm{AC}^{0}$ . Amano [Ama10] observed that the color-coding algorithm for $G$ - $\mathsf{SUB}$ can be implemented by $\mathrm{AC}^{0}$ circuits of size $O(n^{\mathit{tw}(G)+1})$ for fixed $G$ . Our separation of $\kappa$ from treewidth implies that if 1.1 holds in $\mathrm{AC}^{0}$ , then this cannot be proved using average-case complexity as defined here and in [LRR17].

The paper is organized as follows. In Section 2 we introduce some notation and definitions. In Section 3 we define the average-case problem and $\kappa(G)$ , and give an $\tilde{O}(n^{\kappa(G)})$ -time algorithm for the average-case problem. In Section 4 we define $\mathit{emb}(G)$ and prove that $\mathit{emb}(G)$ is $O(\kappa(G))$ . In Section 5 we prove that $\kappa\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)$ is $\Theta(q^{d}/d)$ , and obtain as a corollary that $\mathit{emb}\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)$ is $\Theta(q^{d}/d)$ as well. We also summarize the proof of Chandran and Kavitha [CK06] that $\mathit{tw}\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)$ is $\Theta\mathopen{}\mathclose{{}\left(q^{d}\big{/}\sqrt{d}}\right)$ . In Section 6 we prove our $\mathrm{AC}^{0}$ upper bound.

2 Preliminaries

It will be convenient to define $\tilde{O}(f(n))=f(n)\log^{O(1)}n$ . (This differs from the standard notation when $f(n)=n^{o(1)}$ .) We will often fix a graph $G$ , in which case the constants hidden in asymptotic notation are allowed to depend on $G$ .

We use boldface to denote random variables. The indicator variable $\mathbbm{1}\!\{E\}$ equals 1 if the event $E$ occurs and 0 otherwise. Expected value is denoted $\mathbb{E}[\cdot]$ . An event occurs asymptotically almost surely (a.a.s.) if it occurs with probability $1-o(1)$ as $n$ goes to infinity.

Let $[k]=\{1,\dotsc,k\}$ for $k\in\mathbb{N}$ . If a positive real number $x$ is used in a context where a natural number is expected (for example $[x]$ ), it’s because $x$ can be rounded arbitrarily to $\lceil x\rceil$ or $\lfloor x\rfloor$ without affecting the asymptotic behavior of whatever is being considered.

2.1 Graphs

All graphs we consider are simple and undirected, and may have isolated vertices. If $G$ is a graph then let $V(G)$ and $E(G)$ denote its vertex and edge sets, with respective cardinalities $v(G)$ and $e(G)$ . If $u$ and $v$ are adjacent vertices then we denote the edge connecting them by $uv$ or $vu$ . A graph $H$ is a subgraph of $G$ , denoted $H\subseteq G$ , if $V(H)\subseteq V(G)$ and $E(H)\subseteq E(G)$ .

Definition 2.1 (Colored subgraph isomorphism problem).

For graphs $G$ and $X$ , where $X$ comes with a coloring $\chi:V(X)\rightarrow V(G)$ , the problem $\text{$ G $-$ \mathsf{SUB} $}(X)$ asks whether $X$ has a subgraph $G^{\prime}$ such that $\chi$ (restricted to $V(G^{\prime})$ ) is an isomorphism from $G^{\prime}$ to $G$ .

For $U\subseteq V(G)$ let $G[U]$ be the induced subgraph of $G$ on $U$ , and more generally let $G[U_{1},\dotsc,U_{k}]=G[U_{1}\cup\dotsb\cup U_{k}]$ . Let $G-U=G[V(G)-U]$ , and for $H\subseteq G$ let $G-H=G-V(H)$ .

When the parent graph $G$ is clear in context, let $\deg(u)$ be the degree of a vertex $u$ , and for disjoint $S,T\subseteq V(G)$ let $e(S,T)$ be the number of edges between $S$ and $T$ . Similarly, for vertex-disjoint graphs $A,B\subseteq G$ let $e(A,B)=e(V(A),V(B))$ .

Let $G\cap H$ be the graph with vertex set $V(G)\cap V(H)$ and edge set $E(G)\cap E(H)$ , and define $G\cup H$ similarly. Note that $G\cap H$ may have isolated vertices even if $G$ and $H$ do not. If $A\subseteq B$ are graphs then let $[A,B]=\{H\mid A\subseteq H\subseteq B\}$ , and let $(A,B]$ be the same interval without $A$ , etc.

The Cartesian product of graphs $G$ and $H$ , denoted $G\mathbin{\square}H$ , has vertex set $V(G)\times V(H)$ and edges $(u,v_{1})(u,v_{2})$ for all $u\in V(G)$ and $v_{1}v_{2}\in E(H)$ , and $(u_{1},v)(u_{2},v)$ for all $u_{1}u_{2}\in E(G)$ and $v\in V(H)$ . Let $G^{d}$ be the Cartesian product of $d$ copies of $G$ .

We denote by $K_{k}$ the complete graph on $k$ vertices, also called the $k$ -clique. It follows that $K_{q}^{d}$ has vertex set $[q]^{d}$ , and two vertices are adjacent if and only if they differ in exactly one coordinate. Such graphs are called Hamming graphs. A special case is the $d$ -dimensional hypercube $Q_{d}=K_{2}^{d}$ ; we will use $\{0,1\}^{d}$ for its vertex set.

Definition 2.2 (Graph minor).

A graph $H$ is a minor of a graph $G$ if there exists a minor mapping $\phi$ assigning a connected component of $G$ to each vertex of $H$ , such that $\phi(u)$ and $\phi(v)$ are vertex-disjoint for all $u\neq v$ , and if $uv\in E(H)$ then there exists an edge in $G$ with endpoints in $\phi(u)$ and $\phi(v)$ .

In particular, any subgraph of $G$ is also a minor of $G$ (e.g. let $\phi$ be the identity).

Definition 2.3 (Treewidth).

A tree decomposition of a graph $G$ is a tree $T$ whose vertices are subsets of $V(G)$ (called “bags”), such that each vertex and edge of $G$ is contained in at least one of the bags, and for all $u\in V(G)$ , the induced subgraph of $T$ on the bags that contain $u$ is a connected subtree of $T$ . The width of $T$ is one less than the size of the smallest bag, and the treewidth of $G$ , denoted $\mathit{tw}(G)$ , is the minimum width over all tree decompositions.

Roughly speaking, a graph has small treewidth if and only if it’s “similar to a tree”. See e.g. [Bod98, BK08] for further background, and [HW17] for a survey of parameters that are polynomially tied to treewidth.

The edge expansion of a graph $G$ is defined as follows:

[TABLE]

A bounded-degree expander is a graph with edge expansion $\Omega(1)$ and maximum degree $O(1)$ (see [HLW06] for a survey). Let $\lambda_{i}(G)$ be the $i$ ’th largest eigenvalue of the adjacency matrix of $G$ . We will use the following half of Cheeger’s Inequality:

Fact 2.4 ([AM85]).

If $G$ is a $d$ -regular graph then $h(G)\geq(d-\lambda_{2}(G))/2$ .

Finally, let $\mathbf{ER}\mathopen{}\mathclose{{}\left(n,p}\right)$ be the Erdős-Rényi graph on $n$ vertices in which each possible edge exists independently with probability $p$ .

3 The Average-Case Problem and the Parameter $\kappa(G)$

3.1 Threshold Random Graphs

First we will define threshold weightings, which assign weights to the vertices and edges of a graph subject to certain constraints. Then we will define a family of random graphs for each threshold weighting. The content in this subsection is essentially all from [LRR17].

Definition 3.1.

A threshold weighting on a graph $G$ is a pair $(\alpha,\beta)\in[0,1]^{V(G)}\times[0,2]^{E(G)}$ with the following property. For $H\subseteq G$ let $\alpha(H)=\sum_{u\in V(H)}\alpha(u)$ and $\beta(H)=\sum_{e\in E(H)}\beta(e)$ , and let $\Delta(H)=\alpha(H)-\beta(H)$ . Then, $\Delta(H)\geq 0$ for all $H\subseteq G$ , and $\Delta(G)=0$ . Let $\theta(G)$ be the set of threshold weightings on $G$ .

We will often denote $\Delta=(\alpha,\beta)$ in a slight abuse of notation. (Since $\Delta(u)=\alpha(u)$ if $u$ is a single vertex, the pair $(\alpha,\beta)$ is uniquely determined by $\Delta$ .) The requirement that $\alpha$ be nonnegative is redundant because it’s a special case of the requirement that $\Delta$ be nonnegative. The requirement that $\beta\leq 2$ is also redundant because for every edge $uv$ ,

[TABLE]

It will sometimes be convenient to define $\beta(e)=0$ for $e\notin E(G)$ , e.g. for disjoint sets $S,T\subseteq V(G)$ let $\beta(S,T)=\sum_{u\in S,v\in T}\beta(uv)$ , and for vertex-disjoint $A,B\subseteq G$ let $\beta(A,B)=\beta(V(A),V(B))$ .

Example 3.2 (Markov Chains).

Let $M\in\mathbb{R}_{\geq 0}^{V(G)\times V(G)}$ be a column stochastic matrix (meaning each column sums to 1) such that if $M_{u,v}\neq 0$ then either $u=v$ or $uv\in E(G)$ . Let $\alpha(u)=1-M_{u,u}$ for all $u$ , and $\beta(uv)=M_{u,v}+M_{v,u}$ for all $u\neq v$ . Then for all $H\subseteq G$ ,

[TABLE]

with equality if $H=G$ . In fact, we prove that every threshold weighting is equivalent to at least one Markov Chain (Appendix A).

The following threshold weighting will be especially important, and can be thought of as representing a uniform random walk on $G$ :

Definition 3.3.

If $G$ lacks isolated vertices then let $\Delta_{\mathrm{o}}=(1,\beta_{\mathrm{o}})\in\theta(G)$ be the threshold weighting generated in Example 3.2 when $M_{u,v}=\mathbbm{1}\!\{uv\in E(G)\}/\deg(v)$ . That is, $\Delta_{\mathrm{o}}=(\alpha,\beta)$ , where $\alpha(u)=1$ for all $u$ and $\beta(uv)=1/\deg(u)+1/\deg(v)$ for all $uv$ . If $G$ is $d$ -regular then this simplifies to $\Delta_{\mathrm{o}}=(1,\beta_{\mathrm{o}})=(1,2/d)$ .

Now we define threshold random graphs:

Definition 3.4.

For $\Delta=(\alpha,\beta)\in\theta(G)$ let $\mathbf{X}_{\Delta,n}$ be the graph with vertices $u_{i}$ for $u\in V(G)$ and $i\in[n^{\alpha(u)}]$ , and for $uv\in E(G)$ , each edge $u_{i}v_{j}$ independently with probability $n^{-\beta(uv)}$ . The graph $\mathbf{X}_{\Delta,n}$ comes with the coloring to $G$ defined by $u_{i}\mapsto u$ .

For $H\subseteq G$ and $X$ in the support of $\mathbf{X}_{\Delta,n}$ , let $\mathrm{Sub}_{X}(H)$ be the set of subgraphs $H^{\prime}\subseteq X$ such that the aforementioned coloring (restricted to $V(H^{\prime})$ ) is an isomorphism from $H^{\prime}$ to $H$ . We say that such a graph $H^{\prime}$ is “ $H$ -colored”. Note that $\mathrm{Sub}_{X}(H)$ can be identified with a subset of $\prod_{u\in V(H)}[n^{\alpha(u)}]$ .

Lemma 3.5.

If $\Delta\in\theta(G)$ and $H\subseteq G$ then $\mathbb{E}[|\mathrm{Sub}_{\mathbf{X}_{\Delta,n}}(H)|]=n^{\Delta(H)}(1\pm o(1))$ .

Proof.

Let $(\alpha,\beta)=\Delta$ . The set $\mathrm{Sub}_{\mathbf{X}_{\Delta,n}}(H)$ contains each of its $n^{\alpha(H)}$ possible elements with probability $n^{-\beta(H)}$ , so the result follows from linearity of expectation. (The $1\pm o(1)$ accounts for having to round $n^{\alpha(\cdot)}$ to an integer.) ∎

Lemma 3.5 motivates the requirements that $\Delta$ be nonnegative everywhere and that $\Delta(G)=0$ . Recall that the problem $\text{$ G $-$ \mathsf{SUB} $}(X)$ asks whether $\mathrm{Sub}_{X}(G)$ is the empty set. Since $\Delta(G)$ is required to be zero, it follows that $\mathrm{Sub}_{\mathbf{X}_{\Delta,n}}(G)$ has (approximately) one element on average, and the probability that $\mathrm{Sub}_{\mathbf{X}_{\Delta,n}}(G)$ is empty is known to be bounded away from 0 and 1 as $n$ goes to infinity [LRR17].

3.2 The Parameter $\kappa(G)$ and an Algorithm for the Average Case

We now define $\kappa(G)$ :

Definition 3.6 ([LRR17]).

Let $G$ be a graph with no isolated vertices. Let $\mathrm{Seq}(G)$ be the set of union sequences, meaning sequences $(H_{1},\dotsc,H_{k})$ of distinct subgraphs of $G$ such that $H_{k}=G$ and each $H_{i}$ is either an edge or the union of two previous graphs in the sequence. For $\Delta\in\theta(G)$ let $\kappa_{\Delta}(G)=\min_{S\in\mathrm{Seq}(G)}\max_{H\in S}\Delta(H)$ . Finally, let $\kappa(G)=\max_{\Delta\in\theta(G)}\kappa_{\Delta}(G)$ .

To simplify the exposition, whenever we refer to $\kappa(G)$ , the graph $G$ is implicitly assumed to lack isolated vertices. Li et al. [LRR17] proved that for any fixed $G$ , $\mathrm{AC}^{0}$ circuits solving $\text{$ G $-$ \mathsf{SUB} $}(\mathbf{X}_{\Delta,n})$ a.a.s. require size at least $n^{\kappa_{\Delta}(G)-o(1)}$ and at most $n^{2\kappa_{\Delta}(G)+c}$ (where $c$ is an absolute constant). The results about average-case complexity described in Section 1 are with respect to a $\Delta$ such that $\kappa_{\Delta}(G)=\kappa(G)$ .

Theorem 3.7.

The problem $\text{$ G $-$ \mathsf{SUB} $}(\mathbf{X}_{\Delta,n})$ can be solved in time $\tilde{O}(n^{\kappa_{\Delta}(G)})\leq\tilde{O}(n^{\kappa(G)})$ a.a.s. for any fixed $G$ .

Proof.

First we prove a weaker upper bound of $\tilde{O}(n^{2\kappa_{\Delta}(G)})$ , in a manner analogous to the circuit from [LRR17], and then we describe a modification (on Turing machines) that removes the factor of 2 from the exponent. Later we will remove the factor of 2 in $\mathrm{AC}^{0}$ using a different approach, summarized at the beginning of Section 6.

Let $S$ be a union sequence such that $\kappa_{\Delta}(G)=\max_{H\in S}\Delta(H)$ . For any $H\in S$ , by Lemma 3.5 and Markov’s Inequality, $P\mathopen{}\mathclose{{}\left(|\mathrm{Sub}_{\mathbf{X}_{\Delta,n}}(H)|>n^{\Delta(H)}\log n}\right)\leq 1/\log n$ . (We will obtain a tighter bound of $P(|\mathrm{Sub}_{\mathbf{X}_{\Delta,n}}(H)|>\tilde{O}(n^{\Delta(H)}))\leq n^{-\omega(1)}$ in Section 6.1.) By a union bound it follows that if $X\sim\mathbf{X}_{\Delta,n}$ then $\max_{H\in S}|\mathrm{Sub}_{X}(H)|\leq\tilde{O}(n^{\kappa_{\Delta}(G)})$ a.a.s. Assume this condition holds for $X$ . For each successive $H$ in $S$ , compute $\mathrm{Sub}_{X}(H)$ as follows. If $H$ is a single edge then this is trivial. Otherwise $H=A\cup B$ for some previous $A,B\in S$ , in which case $\mathrm{Sub}_{X}(H)$ is the set of $\mathcal{A}\cup\mathcal{B}$ such that $\mathcal{A}\in\mathrm{Sub}_{X}(A),\mathcal{B}\in\mathrm{Sub}_{X}(B)$ and the projections of $\mathcal{A}$ and $\mathcal{B}$ onto $[n]^{V(A\cap B)}$ are equal. Therefore $\mathrm{Sub}_{X}(H)$ can be computed by brute force in time $\tilde{O}(|\mathrm{Sub}_{X}(A)|\cdot|\mathrm{Sub}_{X}(B)|)\leq\tilde{O}(n^{2\kappa_{\Delta}(G)})$ . Finally, check whether $\mathrm{Sub}_{X}(G)$ is empty.

We can save a quadratic factor by computing $\mathrm{Sub}_{X}(H)$ from $\mathrm{Sub}_{X}(A)$ and $\mathrm{Sub}_{X}(B)$ as follows. (This is a case of the sort-merge join algorithm for computing the natural join of two relations, as defined in database theory [SKS11].) Define a partial order on $[n]^{V(A)}\cup[n]^{V(B)}$ by projecting onto $[n]^{V(A\cap B)}$ and applying the lexicographic order on $[n]^{V(A\cap B)}$ . Sort $\mathrm{Sub}_{X}(A)$ and $\mathrm{Sub}_{X}(B)$ in nondecreasing order, and for convenience add the symbol $\perp$ to the end of both sorted lists. Let $\mathcal{A}$ and $\mathcal{B}$ be the first elements of $\mathrm{Sub}_{X}(A)$ and $\mathrm{Sub}_{X}(B)$ respectively, and initialize an empty accumulator (which will ultimately equal $\mathrm{Sub}_{X}(H)$ ). While $\mathcal{A}\neq\perp$ and $\mathcal{B}\neq\perp$ , do the following. If $\mathcal{A}<\mathcal{B}$ then let $\mathcal{A}$ be the next element of $\mathrm{Sub}_{X}(A)$ . If $\mathcal{B}<\mathcal{A}$ then let $\mathcal{B}$ be the next element of $\mathrm{Sub}_{X}(B)$ . Otherwise, let $\mathcal{B}^{\prime}=\mathcal{B}$ , and while $\mathcal{B}^{\prime}\neq\perp$ and the projections of $\mathcal{A}$ and $\mathcal{B}^{\prime}$ onto $[n]^{V(A\cap B)}$ are equal, add $\mathcal{A}\cup\mathcal{B}^{\prime}$ to the accumulator and let $\mathcal{B}^{\prime}$ be the next element of $\mathrm{Sub}_{X}(B)$ . Then (once the procedure involving $\mathcal{B}^{\prime}$ has finished) let $\mathcal{A}$ be the next element of $\mathrm{Sub}_{X}(A)$ .

Sorting $\mathrm{Sub}_{X}(A)$ and $\mathrm{Sub}_{X}(B)$ takes $\tilde{O}(|\mathrm{Sub}_{X}(A)|+|\mathrm{Sub}_{X}(B)|)$ comparisons, each of which takes $\tilde{O}(1)$ time, and then computing $\mathrm{Sub}_{X}(H)$ takes $\tilde{O}(|\mathrm{Sub}_{X}(A)|+|\mathrm{Sub}_{X}(B)|+|\mathrm{Sub}_{X}(H)|)\leq\tilde{O}(n^{\kappa_{\Delta}(G)})$ time. ∎

We will use the following graph-theoretic properties of $\kappa(G)$ :

Theorem 3.8 ([LRR17]222Specifically, Corollary 4.2, Theorem 4.9, and Theorem 5.1 of [LRR17] correspond to Items 3.8(i), 3.8(ii) and 3.8(iii) respectively.).

Let $G$ be a graph with no isolated vertices.

(i)

There exists $\Delta=(1,\beta)\in\theta(G)$ (meaning $\Delta(u)=1$ for all vertices $u$ ) such that $\kappa(G)=\kappa_{\Delta}(G)$ . 2. (ii)

$\kappa(G)\geq v(G)h(G)/(3\max_{u\in V(G)}\deg(u))$ , where $h(G)$ is the edge expansion of $G$ . 3. (iii)

If $G$ is a minor of some graph $H$ then $\kappa(G)\leq\kappa(H)$ .

Corollary 3.9.

(i)

If $G$ is a bounded-degree expander then $\kappa(G)$ is $\Omega(v(G))$ . 2. (ii)

If $G$ is a $d$ -regular graph then $\kappa(G)\geq v(G)(1-\lambda_{2}(G)/d)/6$ .

Proof of 3.9.

Item 3.9(i) follows from Item 3.8(ii), as observed by Li et al. [LRR17]. Item 3.9(ii) follows from Items 3.8(ii) and 2.4. ∎

4 The Parameter $\mathit{emb}(G)$ and Proof that $\mathit{emb}(G)$ is $O(\kappa(G))$

Recall that $\mathit{emb}(G)$ is significant because of its role in Marx’s ETH-hardness result for $G$ - $\mathsf{SUB}$ , namely Theorem 1.2.

Definition 4.1 ( $\mathit{emb}(G)$ ).

Let $G^{\mathopen{}\mathclose{{}\left(q}\right)}$ be the graph formed by replacing each vertex of $G$ with a $q$ -clique, i.e. it has vertices $u_{i}$ for all $u\in V(G)$ and $i\in[q]$ , and edges $u_{i}v_{j}$ for all $u_{i}\neq v_{j}$ such that either $u=v$ or $uv\in E(G)$ . Let $\mathit{emb}(G)$ be the supremum of all $r>0$ for which there exists $m_{0}=m_{0}(G,r)$ such that if $H$ is any graph with $m\geq m_{0}$ edges and no isolated vertices, then $H$ is a minor of $G^{\mathopen{}\mathclose{{}\left(\lceil m/r\rceil}\right)}$ , and furthermore a minor mapping from $H$ to $G^{\mathopen{}\mathclose{{}\left(\lceil m/r\rceil}\right)}$ can be computed in time $f(G)m^{O(1)}$ for some function $f$ .

Although the requirement that such a minor mapping be efficiently computable is crucial in Theorem 1.2, none of the other results about $\mathit{emb}(G)$ that we reference or derive depend on this requirement, so we may safely ignore it going forward. The following example illustrates 4.1:

Example 4.2 ( $\mathit{emb}(K_{k})$ [Mar10]).

Since $K_{k}^{\mathopen{}\mathclose{{}\left(\lceil m/r\rceil}\right)}=K_{k\lceil m/r\rceil}$ , any graph $H$ with $m$ edges is a minor of $K_{k}^{\mathopen{}\mathclose{{}\left(\lceil m/r\rceil}\right)}$ if and only if $v(H)\leq k\lceil m/r\rceil$ . If $H$ has no isolated vertices then $H$ could have up to $2m$ vertices, so $2m\leq k\lceil m/r\rceil$ . Therefore $\mathit{emb}(K_{k})=k/2$ : it is sufficient for $2m$ to be at most $km/r$ (i.e. $r\leq k/2$ ), and no $r>k/2$ satisfies $2m\leq k\lceil m/r\rceil$ for arbitrarily large $m$ .

*Remark**.*

The name $\mathit{emb}(G)$ comes from the fact that Marx [Mar10] called a minor mapping from $H$ to $G^{\mathopen{}\mathclose{{}\left(q}\right)}$ an “embedding of depth $q$ ” from $H$ into $G$ . Marx used the notation $G^{\mathopen{}\mathclose{{}\left(q}\right)}$ , but the parameter $\mathit{emb}(G)$ is new in the current paper, all results about $\mathit{emb}(G)$ in [Mar10, AM11] having been stated in terms of embeddings of some depth.

The following is used in our proof that $\mathit{emb}(G)$ is $O(\kappa(G))$ :

Lemma 4.3.

$\kappa\mathopen{}\mathclose{{}\left(G^{\mathopen{}\mathclose{{}\left(q}\right)}}\right)\leq q\max(\kappa(G),2)$ .

Proof.

Let $\Delta=(\alpha,\beta)\in\theta\mathopen{}\mathclose{{}\left(G^{\mathopen{}\mathclose{{}\left(q}\right)}}\right)$ such that $\kappa\mathopen{}\mathclose{{}\left(G^{\mathopen{}\mathclose{{}\left(q}\right)}}\right)=\kappa_{\Delta}\mathopen{}\mathclose{{}\left(G^{\mathopen{}\mathclose{{}\left(q}\right)}}\right)$ . Define a threshold weighting $\Delta^{\prime}=(\alpha^{\prime},\beta^{\prime})\in\theta(G)$ as follows: For all $u\in V(G)$ and $uv\in E(G)$ ,

[TABLE]

This is a threshold weighting because if $H\subseteq G$ then $\Delta^{\prime}(H)=\Delta(H^{\mathopen{}\mathclose{{}\left(q}\right)})/q\geq 0$ , with equality if $H=G$ . It’s also normalized to $\alpha^{\prime}\leq 1$ .

Let $S^{\prime}$ be an optimal union sequence for $G$ with respect to $\Delta^{\prime}$ . Construct a union sequence $S$ for $G^{\mathopen{}\mathclose{{}\left(q}\right)}$ as follows:

For each $e\in E(G)$ append an arbitrary union sequence for $e^{\mathopen{}\mathclose{{}\left(q}\right)}$ . 2. 2.

For each $H\in S^{\prime}$ (in order) append $H^{\mathopen{}\mathclose{{}\left(q}\right)}$ .

If $H\subseteq e^{\mathopen{}\mathclose{{}\left(q}\right)}$ then $\Delta(H)\leq\alpha(e^{\mathopen{}\mathclose{{}\left(q}\right)})\leq 2q$ , and we’ve already seen that $\Delta(H^{\mathopen{}\mathclose{{}\left(q}\right)})=q\Delta^{\prime}(H)$ for all $H\in S^{\prime}$ . Therefore,

[TABLE]

Now we prove that $\mathit{emb}(G)$ is $O(\kappa(G))$ (Theorem 1.5), using an argument similar to the proof by Marx [Mar10] that $\mathit{emb}(G)$ is $O(\mathit{tw}(G))$ :

Proof.

Let $r>0$ , and assume there exists an arbitrarily large 3-regular expander $H$ that’s a minor of $G^{\mathopen{}\mathclose{{}\left(\lceil e(H)/r\rceil}\right)}$ . Then by Items 3.9(i), 3.8(iii) and 4.3,

[TABLE]

so $r$ must be $O(\kappa(G))$ . ∎

Li et al. [LRR17] posed the question of whether Theorem 1.2 holds with $\kappa(G)$ in place of $\mathit{emb}(G)$ . By Theorem 1.5 this would be a stronger bound, which makes the question even more interesting. This problem is open even in the case of 3-regular expanders: recall from Section 1 that if $G$ is a 3-regular expander then $\mathit{emb}(G)$ is $\Theta(\mathit{tw}(G)/\log\mathit{tw}(G))$ and $\kappa(G)$ is $\Theta(\mathit{tw}(G))$ [AM11, LRR17].

The fact that $\kappa(G)$ is $\Omega(\mathit{emb}(G))$ gives an alternate proof, besides the one in [LRR17], that $\kappa(G)$ is $\Omega(\mathit{tw}(G)/\log\mathit{tw}(G))$ .

5 Separating $\kappa$ from Treewidth

In Section 5.1 we prove that $\kappa(K_{k})=k/4+O(1)$ , which is a special case of the more general result that $\kappa\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)=\Theta(q^{d}/d)$ , and improves on the observation of Li et al. [LRR17] that $\kappa(K_{k})$ is $\Theta(k)$ . We obtain tighter multiplicative constants in the case $d=1$ , and it provides an opportunity to illustrate the main ideas of our proof in a simpler setting, but it may be skipped without penalty. In Section 5.2 we prove that $\kappa\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)$ is $O(q^{d}/d)$ when $q$ is even, which is sufficient to separate $\kappa$ from treewidth. Again, this case is cleaner than the general case and conveys most of the intuition behind it. In Appendix B we prove that $\kappa\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)$ is $O(q^{d}/d)$ for all $q$ . In Section 5.3 we prove that $\kappa\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)$ is $\Omega(q^{d}/d)$ in two different ways, completing the proof that $\kappa\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)$ is $\Theta(q^{d}/d)$ (Theorem 1.4), and we obtain as a corollary that $\mathit{emb}\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)$ is $\Theta(q^{d}/d)$ as well. In Section 5.4 we summarize the proof of Chandran and Kavitha [CK06] that $\mathit{tw}\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)$ is $\Theta\mathopen{}\mathclose{{}\left(q^{d}\big{/}\sqrt{d}}\right)$ .

5.1 Proof that $\kappa(K_{k})=k/4+O(1)$

Rossman [Ros08] proved that $\kappa_{\Delta_{\mathrm{o}}}(K_{k})\geq k/4$ (recall 3.3), so it suffices to prove the upper bound. By Item 3.8(i) it suffices to prove that $\kappa_{\Delta}(K_{k})\leq k/4+O(1)$ for an arbitrary $\Delta=(1,\beta)\in\theta(G)$ . First we construct, by downwards induction, a sequence $U_{1}\subseteq\dotsb\subseteq U_{k}=V(K_{k})$ such that $U_{i}$ is an $i$ -element subset of $V(K_{k})$ and $\beta(K_{k}[U_{i}])\geq\beta_{\mathrm{o}}(K_{k}[U_{i}])$ for all $i$ . The set $U_{k}=V(K_{k})$ satisfies this requirement because $\beta(K_{k})$ and $\beta_{\mathrm{o}}(K_{k})$ are both equal to $k$ . Given $U_{i}$ , let $\mathbf{U}_{i-1}$ be an $(i-1)$ -element subset of $U_{i}$ chosen uniformly at random. Each pair of elements in $U_{i}$ is included in $\mathbf{U}_{i-1}$ with the same probability $p_{i}$ ( $=1-2/i$ ), so by linearity of expectation,

[TABLE]

Therefore there exists a fixed $U_{i-1}$ such that $\beta(K_{k}[U_{i-1}])\geq\beta_{\mathrm{o}}(K_{k}[U_{i-1}])$ .

Construct a union sequence $S$ for $K_{k}$ as follows: start by enumerating the edges, and then for $i$ from 1 to $k-1$ , append $(K_{k}[U_{i}]\cup e_{1},K_{k}[U_{i}]\cup e_{1}\cup e_{2},\dotsc,K_{k}[U_{i+1}])$ , where $e_{1},e_{2},\dotsc$ are the edges between $U_{i}$ and $U_{i+1}-U_{i}$ . Then,

[TABLE]

Finally, as observed in [Ros08], since $K_{k}$ is $(k-1)$ -regular it follows from Eq. 1 that

[TABLE]

5.2 Proof that $\kappa\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)$ is $O(q^{d}/d)$ if $q$ is Even

First we reduce to the case $q=2$ . The graph $K_{q}^{d}$ is a subgraph of $Q_{d}^{\mathopen{}\mathclose{{}\left((q/2)^{d}}\right)}$ (recall 4.1), as evidenced by the following argument. Let $\phi_{L}:[q]\rightarrow\{0,1\}$ and $\phi_{R}:[q]\rightarrow[q/2]$ such that $\phi_{L}\times\phi_{R}$ is a bijection from $[q]$ to $\{0,1\}\times[q/2]$ , and let $\psi:[q/2]^{d}\rightarrow[(q/2)^{d}]$ be another arbitrary bijection. Then the following map is an injective homomorphism from $K_{q}^{d}$ to $Q_{d}^{\mathopen{}\mathclose{{}\left((q/2)^{d}}\right)}$ :

[TABLE]

By Items 3.8(iii) and 4.3, if $\kappa(Q_{d})$ is $O(2^{d}/d)$ then

[TABLE]

Now we prove that $\kappa(Q_{d})$ is $O(2^{d}/d)$ , following some brief definitions and a high-level overview of the argument. Fix $d$ . We identify each $u\in\{0,1\}^{d}$ with $\sum_{i=0}^{d-1}u_{i}2^{i}$ . For $0\leq a\leq 2^{d}$ let $G(a)=Q_{d}[0,\dotsc,a-1]$ . Recall that $\Delta_{\mathrm{o}}=(1,\beta_{\mathrm{o}})=(1,2/d)$ is a threshold weighting on $Q_{d}$ (3.3). Let $\mu=\max_{0\leq a\leq 2^{d}}\Delta_{\mathrm{o}}(G(a))$ .

*Remark**.*

The intuition behind $\mu$ is as follows. The reader may note that $\kappa_{\Delta_{\mathrm{o}}}(Q_{d})\leq\mu+1$ , by reasoning analogous to that in Section 5.1. That is, for each vertex $u$ of $Q_{d}$ in increasing lexicographic order, add to an accumulator all edges $uv$ for which $v<u$ .

There is another union sequence captured by $\mu$ as well. If a subgraph $B\subseteq Q_{d}$ is isomorphic to $Q_{k}$ for some $k$ , then since $Q_{k}$ is isomorphic to $G(2^{k})$ (and $\beta_{\mathrm{o}}$ is uniform) it follows that $\Delta_{\mathrm{o}}(B)\leq\mu$ . Consider a depth- $d$ binary tree in which each node at depth $k$ is a subgraph of $Q_{d}$ isomorphic to $Q_{d-k}$ (in particular, the root is $Q_{d}$ and the leaves are vertices), and each interior node is the union of its two children along with some additional edges corresponding to a coordinate cut. This tree describes a union sequence $S$ for $Q_{d}$ : recursively obtain the graphs $L$ and $R$ corresponding to the children of $Q_{d}$ , and then take $L\cup R$ and add the missing edges. Note that $\max_{H\in S}\Delta_{\mathrm{o}}(H)=2\max_{0\leq k\leq d}\Delta_{\mathrm{o}}(G(2^{k}))\leq 2\mu$ .

Analogous to Section 5.1, the upper bound is obtained by comparing $\kappa_{\Delta}(Q_{d})$ to $\mu$ for each $\Delta$ , and bounding $\mu$ . For this purpose we will consider the two union sequences mentioned above, as well as hybrids of them.

The proof is as follows:

[TABLE]

For each threshold weighting $\Delta\in\theta(Q_{d})$ , it will be convenient in the following to generalize $\kappa_{\Delta}$ to subgraphs $H\subseteq Q_{d}$ by $\kappa_{\Delta}(H)=\min_{S\in\mathrm{Seq}(H)}\max_{F\in S}\Delta(F)$ . (This is a nontrivial generalization of the definition of $\kappa_{\Delta}$ , because if $\Delta(H)>0$ then the restriction of $\Delta$ to subgraphs of $H$ is not a threshold weighting on $H$ .) Also if $H$ is a single-vertex graph or the empty graph then let $\kappa_{\Delta}(H)=0$ .

Lemma 5.1.

Let $0\leq a\leq 2^{d}$ and $0\leq k\leq d$ be such that $2^{k}$ divides $a$ . Let $\Delta=(1,\beta)\in\theta(Q_{d})$ be such that $\beta(G(a))\geq\beta_{\mathrm{o}}(G(a))$ and $\beta(G(a+2^{k}))\geq\beta_{\mathrm{o}}(G(a+2^{k}))$ , and $\kappa_{\Delta}(G(a))\leq 2\mu$ . Then $\kappa_{\Delta}(G(a+2^{k}))\leq 2\mu$ .

Proof.

The proof is by induction on $k$ . The inductive hypothesis will actually be (slightly) stronger in the following way: given a labeling of the vertices of $Q_{d}$ with the elements of $\{0,1\}^{d}$ , the labels can be rearranged according to any of the $2^{d}d!$ isomorphisms of $Q_{d}$ , and the inductive hypothesis is required to hold with respect to any such labeling. (The value of $\mu$ doesn’t depend on the labeling used in its definition because of the symmetry of $\beta_{\mathrm{o}}$ .)

Let $B$ = $G(a+2^{k})-G(a)$ . Since $2^{k}$ divides $a$ , it follows that $B$ is isomorphic to $Q_{k}$ . In the inductive step we handle separately the cases where $\beta(B)\geq\beta_{\mathrm{o}}(B)$ and $\beta(B)<\beta_{\mathrm{o}}(B)$ . The base case is a special case of the former because if $B$ is a single vertex then $\beta(B)$ and $\beta_{\mathrm{o}}(B)$ are both zero.

Case 1: $\beta(B)\geq\beta_{\mathrm{o}}(B)$ . If $k=0$ then $\kappa_{\Delta}(B)=0\leq 2\mu$ ; we now obtain the same result in the case where $k>0$ . For $0\leq i<k$ and $b\in\{0,1\}$ let $B(i,b)=B[v\in V(B)\mid v_{i}=b]$ . Choose $\mathbf{i}\in\{0,\dotsc,k-1\}$ and $\mathbf{b}\in\{0,1\}$ independently and uniformly at random. By symmetry, each $e\in E(B)$ is in $B(\mathbf{i},\mathbf{b})$ with the same probability $p$ . (Specifically, $p=(k-1)/2k$ : For any edge $uv$ , there is a unique index $i$ in which $u$ and $v$ differ. If $\mathbf{i}=i$ then exactly one of $u$ and $v$ is in $B(\mathbf{i},\mathbf{b})$ ; otherwise $uv$ is in $B(\mathbf{i},\mathbf{b})$ with probability 1/2 depending on $\mathbf{b}$ .) By linearity of expectation,

[TABLE]

Similarly, $\mathbb{E}[\beta_{\mathrm{o}}(B(\mathbf{i},\mathbf{b}))]=p\beta_{\mathrm{o}}(B)$ . By our assumption that $\beta(B)\geq\beta_{\mathrm{o}}(B)$ ,

[TABLE]

Therefore there exist fixed $i$ and $b$ such that $\beta(B(i,b))\geq\beta_{\mathrm{o}}(B(i,b))$ .

Now our claim that $\kappa_{\Delta}(B)\leq 2\mu$ follows from two applications of the inductive hypothesis. Since we required the inductive hypothesis to hold for all labelings of $Q_{d}$ , we can assume without loss of generality that $i=k-1$ and $b=0$ . Ignoring $G(a)$ , an application of the inductive hypothesis with $a^{\prime}=0$ and $k^{\prime}=k-1$ reveals that $\kappa_{\Delta}(B(i,b))\leq 2\mu$ , and then a second application of the inductive hypothesis with $a^{\prime\prime}=2^{k-1}$ and $k^{\prime\prime}=k-1$ reveals that $\kappa_{\Delta}(B)\leq 2\mu$ .

Let $S$ be an optimal (with respect to $\Delta$ ) union sequence for $G(a)$ , followed by an optimal union sequence for $B$ , followed by $G(a)\cup B,G(a)\cup B\cup e_{1},\dotsc,G(a)\cup B\cup\{e_{j}\}$ , where the $\{e_{j}\}$ are the edges between $G(a)$ and $B$ in $Q_{d}$ .333It is also necessary to add each edge $e_{j}$ individually to the union sequence, but clearly $\Delta(e_{j})\leq 2\leq 2(2-2/d)=2\Delta_{\mathrm{o}}(G(1))\leq 2\mu$ if $d\geq 2$ , and if $d=1$ then the lemma holds trivially. (If $G(a)$ or $B$ lacks edges then omit certain graphs from this sequence.) Then,

[TABLE]

We proceed to bound each of these three terms by $2\mu$ , completing the proof. We have assumed that $\kappa_{\Delta}(G(a))\leq 2\mu$ , and proved that $\kappa_{\Delta}(B)\leq 2\mu$ . We have also assumed that $\beta(G(a))\geq\beta_{\mathrm{o}}(G(a))$ , and since $\Delta$ and $\Delta_{\mathrm{o}}$ both evaluate to 1 on all vertices, it follows that $\Delta(G(a))\leq\Delta_{\mathrm{o}}(G(a))\leq\mu$ (with the last step following from the definition of $\mu$ ). Similarly, since $B$ is isomorphic to $G(2^{k})$ it follows that $\Delta(B)\leq\Delta_{\mathrm{o}}(B)\leq\mu$ . Therefore $\Delta(G(a))+\Delta(B)\leq 2\mu$ .

Case 2: $\beta(B)<\beta_{\mathrm{o}}(B)$ . For $i<k$ and $b\in\{0,1\}$ let $H(i,b)=Q_{d}[0,\dotsc,a-1,V(B(i,b))]$ (where $B(i,b)$ is defined as above). Choose $\mathbf{i}<k$ and $\mathbf{b}\in\{0,1\}$ independently and uniformly at random. Note that $\beta(G(a+2^{k}))=\beta(G(a))+\beta(G(a),B)+\beta(B)$ .444Recall from 3.1 that $\beta(A,B)\coloneqq\sum_{u\in V(A),v\in V(B)}\beta(uv)$ . By reasoning similar to that in the previous case (and applying our various assumptions),

[TABLE]

Therefore $\beta(H(i,b))>\beta_{\mathrm{o}}(H(i,b))$ for some fixed $i$ and $b$ .

Assume without loss of generality that $i=k-1$ and $b=0$ ; then $H(i,b)=G(a+2^{k-1})$ . Applying the inductive hypothesis with $a^{\prime}=a$ and $k^{\prime}=k-1$ reveals that $\kappa_{\Delta}(G(a+2^{k-1}))\leq 2\mu$ , and then applying the inductive hypothesis with $a^{\prime\prime}=a+2^{k-1}$ and $k^{\prime\prime}=k-1$ reveals that $\kappa_{\Delta}(G(a+2^{k}))\leq 2\mu$ . ∎

Lemma 5.2.

$\mu<2/3\cdot 2^{d}/d$ .

Proof.

For any $0\leq a\leq 2^{d}$ , it follows from Eq. 1 that $\Delta_{\mathrm{o}}(G(a))=e(G(a),Q_{d}-G(a))/d$ , so it suffices to prove that $e(G(a),Q_{d}-G(a))<2^{d+1}/3$ for all $a$ . Let $G(a,b)=Q_{d}[a,\dotsc,b-1]$ . Since

[TABLE]

(as can be seen by applying the automorphism $(x_{1},\dotsc,x_{d})\mapsto(1-x_{1},\dotsc,1-x_{d})$ to $Q_{d}$ ), we can restrict our search to $a\in[0,2^{d-1}]$ . In that case,

[TABLE]

By the same reasoning,

[TABLE]

Since $e(G(0,a),G(a,2^{d-1}))=e(G(0,2^{d-1}-a),G(2^{d-1}-a,2^{d-1}))$ (consider a similar automorphism), it follows that if $a<2^{d-2}$ then

[TABLE]

Therefore we can restrict our search to $a\in[2^{d-2},2^{d-1}]$ , in which case

[TABLE]

By induction it follows that $\mu=2^{d-1}+2^{d-3}+2^{d-5}+\dotsb+\text{(2 or 1)}<2^{d+1}/3$ . ∎

*Remark**.*

Harper [Har04] proved that out of all subgraphs of $Q_{d}$ with $a$ vertices, $G(a)=G(0,a)$ has the fewest outgoing edges [Fil15].

5.3 Proof that $\kappa\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)$ is $\Omega(q^{d}/d)$ and $\mathit{emb}\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)$ is $\Theta(q^{d}/d)$

Alon and Marx [AM11, Theorem 4.3] proved that $\mathit{emb}\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)$ is $\Omega(q^{d}/d)$ , and it follows from Theorem 1.5 that $\mathit{emb}\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)\leq O\mathopen{}\mathclose{{}\left(\kappa\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)}\right)\leq O(q^{d}/d)$ . Therefore $\mathit{emb}\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)$ is $\Theta(q^{d}/d)$ .

It is implicit in the above argument that $\kappa\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)\geq\Omega\mathopen{}\mathclose{{}\left(\mathit{emb}\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)}\right)\geq\Omega(q^{d}/d)$ ; we now present an alternate proof that $\kappa\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)$ is $\Omega(q^{d}/d)$ based on edge expansion. Since $K_{q}^{d}$ is $d(q-1)$ -regular, by Item 3.9(ii) it suffices to prove that $1-\lambda_{2}\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)/d(q-1)$ is $\Omega(1/d)$ . We use the following well-known fact, where graphs are identified with their adjacency matrices:

Fact 5.3.

The eigenvalues of $G\mathbin{\square}H$ are $\lambda_{i}(G)+\lambda_{j}(H)$ for $i\in[v(G)],j\in[v(H)]$ .

Proof.

Observe that $G\mathbin{\square}H=G\otimes I+I\otimes H$ , where the symbols $\otimes$ and $I$ denote the tensor product and the identity matrix respectively. Let $u_{i}$ (resp. $w_{i}$ ) be the $i$ ’th eigenvector of $G$ (resp. $H$ ); clearly $u_{i}\otimes w_{j}$ is an eigenvector of $G\mathbin{\square}H$ with eigenvalue $\lambda_{i}(G)+\lambda_{j}(H)$ . Since a real symmetric matrix (in particular $G$ or $H$ ) has an orthogonal eigenbasis, it follows that the $u_{i}\otimes w_{j}$ are also orthogonal. Since $v(G\mathbin{\square}H)=v(G)v(H)$ , there are no other eigenvalues of $G\mathbin{\square}H$ . ∎

Since $\lambda_{i}(K_{q})$ equals $q-1$ if $i=1$ and $-1$ otherwise, repeated application of Fact 5.3 reveals that $\lambda_{2}\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)=(q-1)(d-1)-1=d(q-1)-q$ , so $1-\lambda_{2}\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)/d(q-1)=q/(q-1)d$ , as desired.

*Remark**.*

Fact 2.4 is an equality in the case of hypercubes (see e.g. [HLW06]): let $i\in[d]$ and define a cut by partitioning the vertices according to the values of their $i$ ’th coordinates. So for hypercubes, all slack in the application of Item 3.9(ii) comes from Item 3.8(ii).

5.4 Proof that $\mathit{tw}\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)$ is $\Theta\mathopen{}\mathclose{{}\left(q^{d}/\sqrt{d}}\right)$ , Summarized

(See [CK06] for the full proof.) The proof that $\mathit{tw}\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)$ is $O\mathopen{}\mathclose{{}\left(q^{d}\big{/}\sqrt{d}}\right)$ reduces to the case $q=2$ by reasoning analogous to that in the beginning of Section 5.2. For $k\in[d]$ let $U_{k}$ be the set of vertices of $Q_{d}$ with exactly $k$ or $k-1$ ones. The path $(U_{1},\dotsc,U_{d})$ is a tree decomposition of $Q_{d}$ with width approximately $2\binom{d}{d/2}$ , and by Stirling’s approximation this is $\Theta\mathopen{}\mathclose{{}\left(2^{d}\big{/}\sqrt{d}}\right)$ .555Compared to the tree decomposition from [CK06], this one is a simpler variant whose width is larger by up to a constant factor.

For a graph $G$ let $\phi(G)$ be the minimum over all $U\subseteq V(G),v(G)/4\leq|U|\leq v(G)/2$ of the number of vertices in $V(G)-U$ with at least one neighbor in $U$ . From a result of Robertson and Seymour [RS86] it follows that $\mathit{tw}(G)\geq\phi(G)-1$ , and from a result of Harper [Har99] it follows that $\phi\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)$ is $\Omega\mathopen{}\mathclose{{}\left(q^{d}\big{/}\sqrt{d}}\right)$ . (Also note the parallels between $\mathit{tw}(G)\geq\phi(G)-1$ and Item 3.8(ii); interestingly, we’ve sign that both are tight to within a constant factor in the case of $K_{q}^{d}$ .)

6 $\mathrm{AC}^{0}$ Upper Bound

An $\mathrm{AC}^{0}$ circuit is a constant-depth circuit with unbounded-fanin AND and OR gates and NOT gates. Fix a graph $G$ and threshold weighting $\Delta\in\theta(G)$ for the remainder of this section. We prove the following, which is a more precise statement of Theorem 1.3:

Theorem 6.1.

There exists an $\mathrm{AC}^{0}$ circuit with $n^{\kappa_{\Delta}(G)+c}$ wires that solves $\text{$ G $-$ \mathsf{SUB} $}(\mathbf{X}_{\Delta,n})$ with probability $1-n^{-\omega(1)}$ , where $c>0$ is an absolute constant.

Since in any circuit the number of gates is at most one plus the number of wires, the circuit from Theorem 6.1 has size $n^{\kappa_{\Delta}(G)+O(1)}\leq n^{\kappa(G)+O(1)}$ . (In this discussion, all $\pm O(1)$ terms in an exponent are independent of $G$ .) For comparison, it was proved in [LRR17] (building on a line of previous work [Ros08, Ama10, Ros10, NW11]) that the average-case $\mathrm{AC}^{0}$ complexity of $\text{$ G $-$ \mathsf{SUB} $}(\mathbf{X}_{\Delta,n})$ is between $n^{\kappa_{\Delta}(G)-o(1)}$ and $n^{2\kappa_{\Delta}(G)+O(1)}$ . Another related result, regarding the uncolored $k$ -clique problem, is that the average-case $\mathrm{AC}^{0}$ complexity of $\text{$ K_{k} $-$ \mathsf{SUB}_{\mathrm{uncol}} $}\mathopen{}\mathclose{{}\left(\mathbf{ER}\mathopen{}\mathclose{{}\left(n,n^{-2/(k-1)}}\right)}\right)$ is at most $n^{k/4+O(1)}$ [Ama10, Ros14] ( $=n^{\kappa(K_{k})\pm O(1)}$ by Section 5.1). See [Ros18] for a survey of the average-case circuit complexity of subgraph isomorphism more generally.

One challenge to implementing the algorithm behind Theorem 3.7 in $\mathrm{AC}^{0}$ is that sorting cannot be done in (polynomial-size) $\mathrm{AC}^{0}$ [Hås86]. The $n^{2\kappa_{\Delta}(G)+O(1)}$ -size circuit from [LRR17] computes $\mathrm{Sub}_{X}(A\cup B)$ by finding the relevant pairs in $\mathrm{Sub}_{X}(A)\times\mathrm{Sub}_{X}(B)$ by brute force with $\tilde{O}(|\mathrm{Sub}_{X}(A)|\cdot|\mathrm{Sub}_{X}(B)|)$ gates. Our circuit differs in that we represent $\mathrm{Sub}_{X}(H)$ as a depth- $v(H)$ tree, where the non-root vertices are assigned labels in $[n]$ , and the (sequences of labels along the) paths from the root to the leaves correspond to the elements of $\mathrm{Sub}_{X}(H)$ . This will allow us to compute $\mathrm{Sub}_{X}(A\cup B)$ with high probability given $\mathrm{Sub}_{X}(A)$ and $\mathrm{Sub}_{X}(B)$ , on a circuit of size nearly linear in $|\mathrm{Sub}_{X}(A)|+|\mathrm{Sub}_{X}(B)|$ . A key fact in our construction is that $\mathrm{AC}^{0}$ circuits can (with high probability) convert between representations of $\mathrm{Sub}_{X}(H)$ corresponding to different orderings of $V(H)$ .

Our construction requires fairly precise estimates for how many children to assign each node. Luckily this number is highly concentrated around its mean if the input graph is $\mathbf{X}_{\Delta,n}$ . This result will follow from the concentration inequality below, whose statement requires several definitions:

Definition 6.2.

Let $X$ be in the support of $\mathbf{X}_{\Delta,n}$ , and let $U\subseteq G$ be an arbitrary graph (which we think of as a “universe”). Let $\mathrm{Sub}_{n}(U)$ be the set of all possible elements of $\mathrm{Sub}_{\mathbf{X}_{\Delta,n}}(U)$ ; note that this can be identified with $\prod_{v\in V(U)}[n^{\alpha(v)}]$ . If $A\subseteq U$ and $\mathcal{A}\in\mathrm{Sub}_{n}(A)$ then let $\mathcal{A}$ extend to $U$ in $X$ if there exists a graph $\mathcal{U}\in\mathrm{Sub}_{X}(U)$ (called a $U$ -extension of $\mathcal{A}$ ) such that $\mathcal{A}\subseteq\mathcal{U}$ . (In context, $X$ or $\mathbf{X}$ will be implicit.) Equivalently, $\mathcal{A}$ could be required to be in $\mathrm{Sub}_{X}(A)$ rather than $\mathrm{Sub}_{n}(A)$ in the latter definition.

Let $\Delta^{*}_{U}(A)=\min_{A\subseteq H\subseteq U}\Delta(H)$ . Let $X$ be good if for all graphs $U\subseteq G$ and $A\subseteq U$ , and for all $\mathcal{A}\in\mathrm{Sub}_{n}(A)$ and vertices $v\in V(U)-V(A)$ , there are $\tilde{O}\mathopen{}\mathclose{{}\left(n^{\Delta^{*}_{U}(A\cup v)-\Delta^{*}_{U}(A)}}\right)$ values of $i\in[n^{\alpha(v)}]$ such that $\mathcal{A}\cup v_{i}$ extends to $U$ . (Recall our unconventional definition of $\tilde{O}(\cdot)$ from Section 2, e.g. $\tilde{O}(1)$ denotes $\log^{O(1)}n$ .) Finally, let an event occur with high probability (w.h.p.) if it occurs with probability $1-n^{-\omega(1)}$ .

We prove the following:

Theorem 6.3.

The graph $\mathbf{X}_{\Delta,n}$ is good w.h.p.

Observe that this is a substantially stronger concentration bound than the application of Markov’s Inequality in the proof of Theorem 3.7. In Section 6.1 we prove Theorem 6.3, and then in Section 6.2 we use this result to prove Theorem 6.1. Both proofs use the following concentration inequality, which is proved by a Chernoff bound:

Lemma 6.4.

If $\mathbf{S}=\mathbf{S}(n)$ is a sum of independent Bernoulli random variables then w.h.p. $\mathbf{S}\leq\max(\mathbb{E}[\mathbf{S}],1)\cdot\tilde{O}(1)$ .

Proof.

Let $\mathbf{S}=\sum_{i}\mathbf{B}_{i}$ be a decomposition of $\mathbf{S}$ as a sum of independent Bernoulli random variables. Let $p_{i}=\mathbb{E}[\mathbf{B}_{i}]$ and $\mu=\mathbb{E}[\mathbf{S}]=\sum_{i}p_{i}$ . Then for $r,t\geq 0$ ,

[TABLE]

Letting $t=\log(r/\mu)$ gives $P(\mathbf{S}\geq r)\leq(e\mu/r)^{r}$ assuming $t\geq 0$ , and then (for example) letting $r=\max(\mu,1)\log^{2}n$ gives, for sufficiently large $n$ ,

[TABLE]

6.1 Proof of Theorem 6.3

First we derive some algebraic properties of the threshold weighting $\Delta$ .

Lemma 6.5.

If $A,B\subseteq G$ then $\Delta(A)+\Delta(B)=\Delta(A\cap B)+\Delta(A\cup B)$ .

Proof.

Each vertex or edge in one (resp. two) of $A$ and $B$ is also in one (resp. two) of $A\cap B$ and $A\cup B$ . ∎

Definition 6.6.

For $A\subseteq U\subseteq G$ let $\Gamma_{U}(A)=\bigcap\{H\in[A,U]\mid\Delta(H)=\Delta^{*}_{U}(A)\}$ , and let $A$ be a $U$ -base if $\Delta(A)=\Delta^{*}_{U}(A)$ .

Throughout this subsection, $U$ will be an arbitrary subgraph of $G$ unless additional structure is imposed on it, and missing subscripts on $\Delta^{*}$ and $\Gamma$ default to $U$ .

Lemma 6.7.

If $A\subseteq U$ then $\Delta(\Gamma(A))=\Delta^{*}(A)$ and $A\subseteq\Gamma(A)$ .

Proof.

It suffices to show that the set $S=\{H\in[A,U]\mid\Delta(H)=\Delta^{*}(A)\}$ is closed under intersection. Let $B,C\in S$ . By the definition of $S$ , Lemma 6.5, and the fact that $A\subseteq B\cup C$ ,

[TABLE]

so $\Delta(B\cap C)\leq\Delta^{*}(A)$ . On the other hand, $\Delta(B\cap C)\geq\Delta^{*}(A)$ because $A\subseteq B\cap C$ . Therefore $\Delta(B\cap C)=\Delta^{*}(A)$ , so $B\cap C\in S$ . ∎

Lemma 6.8.

If $A\subseteq\Gamma(A)\subseteq U^{\prime}\subseteq U$ then $\Gamma(A)$ is a $U^{\prime}$ -base.

Proof.

Since the interval $[A,U]$ includes the interval $[\Gamma(A),U^{\prime}]$ , it follows from Lemma 6.7 that $\Delta(\Gamma(A))=\Delta^{*}_{U}(A)\leq\Delta^{*}_{U^{\prime}}(\Gamma(A))\leq\Delta(\Gamma(A))$ . Therefore $\Delta(\Gamma(A))=\Delta^{*}_{U^{\prime}}(\Gamma(A))$ . ∎

Lemma 6.9.

If $A\subseteq B\subseteq U$ then $\Gamma(A)\subseteq\Gamma(B)$ .

Proof.

Since $B\subseteq\Gamma(B)\subseteq\Gamma(B)\cup\Gamma(A)$ it follows that $\Delta^{*}(B)\leq\Delta(\Gamma(B)\cup\Gamma(A))$ , so by Lemmas 6.7 and 6.5,

[TABLE]

Therefore $\Delta^{*}(A)\geq\Delta(\Gamma(A)\cap\Gamma(B))$ . On the other hand, since $A\subseteq\Gamma(A)$ and $A\subseteq B\subseteq\Gamma(B)$ it follows that $A\subseteq\Gamma(A)\cap\Gamma(B)$ , so $\Delta^{*}(A)\leq\Delta(\Gamma(A)\cap\Gamma(B))$ . Therefore $\Delta^{*}(A)=\Delta(\Gamma(A)\cap\Gamma(B))$ , so it follows from the definition of $\Gamma(A)$ that $\Gamma(A)\subseteq\Gamma(A)\cap\Gamma(B)\subseteq\Gamma(B)$ . ∎

We now analyze the concentration of $\mathbf{X}_{\Delta,n}$ , making liberal use of the fact that if $n^{O(1)}$ events occur with uniformly high probability then their conjunction also occurs w.h.p. by a union bound. For the rest of this subsection, “extensions” are with respect to an implicit $\mathbf{X}\equiv\mathbf{X}_{\Delta,n}$ .

Lemma 6.10.

If $A\subseteq U$ and $\Gamma_{U}(A)=U$ (i.e. $\Delta(H)>\Delta(U)$ for all $H\in[A,U)$ ) then the number of $U$ -extensions of any $\mathcal{A}\in\mathrm{Sub}_{n}(A)$ is $\tilde{O}(1)$ w.h.p.

(The above conditions are equivalent because, by the definition of $\Gamma(A)$ , we have $\Gamma(A)=U$ if and only if $U$ is the unique $H\in[A,U]$ that minimizes $\Delta(H)$ .)

Proof.

The result is trivial for $A=U$ ; assume it’s true for all $B\in(A,U]$ and that $A\neq U$ . (Since $\Delta(H)>\Delta(U)$ for all $H\in[A,U)$ , for any $B\in(A,U]$ it is the case that $\Delta(H)>\Delta(U)$ for all $H\in[B,U)$ .) Assume without loss of generality that $A=U[V(A)]$ , since all $U$ -extensions of $\mathcal{A}$ are also $U$ -extensions of $\mathcal{A}$ ’s unique possible $U[V(A)]$ -extension. Also condition on $\mathcal{A}\subseteq\mathbf{X}$ , since otherwise $\mathcal{A}$ trivially has zero $U$ -extensions.

There are $n^{\alpha(U)-\alpha(A)}$ possible $U$ -extensions of $\mathcal{A}$ , so there are at most $n^{(\alpha(U)-\alpha(A))\log n}$ sets of $\log n$ possible $U$ -extensions of $\mathcal{A}$ whose projections onto $\mathrm{Sub}_{n}(U-A)$ are pairwise vertex-disjoint. (This is true even if we omit the condition about vertex-disjointness.) For each of these sets, all of its elements are subgraphs of $\mathbf{X}$ with probability $n^{(-\beta(U)+\beta(A))\log n}$ , so this occurs for at least one such set with probability at most $n^{(\Delta(U)-\Delta(A))\log n}$ (by a union bound). By assumption, $\Delta(U)-\Delta(A)<0$ , so w.h.p. any set of $U$ -extensions of $\mathcal{A}$ whose projections onto $\mathrm{Sub}_{n}(U-A)$ are pairwise vertex-disjoint has $\tilde{O}(1)$ elements.

Let $S$ be one such set, such that $S$ is maximal. It follows that every $U$ -extension of $\mathcal{A}$ agrees with some element of $S$ on some vertex in $V(U)-V(A)$ . Therefore $\mathcal{A}$ has at most $\sum_{\mathcal{U}\in S}\sum_{H\in(A,U]}E(\mathcal{U},H)$ $U$ -extensions, where $E(\mathcal{U},H)$ is the number of $U$ -extensions of $\mathcal{A}$ that agree with $\mathcal{U}$ on precisely $H$ . By the inductive hypothesis, $E(\mathcal{U},H)$ is $\tilde{O}(1)$ w.h.p. for all $\mathcal{U}$ and $H$ (independent of $S$ ), so $\mathcal{A}$ has $\tilde{O}(1)$ $U$ -extensions w.h.p. by a union bound. ∎

Lemma 6.11.

If $A$ is a $U$ -base then any $\mathcal{A}\in\mathrm{Sub}_{n}(A)$ has $\tilde{O}(n^{\Delta(U)-\Delta(A)})$ $U$ -extensions w.h.p.

Proof.

Again, assume that $A$ is an induced subgraph of $U$ and condition on $\mathcal{A}\subseteq\mathbf{X}$ . Also assume without loss of generality that $\beta$ is strictly positive on $E(U)$ . The proof is by induction on $v(U)-v(A)$ , for all $U\subseteq G$ . The base case $A=U$ is trivial. Fix an arbitrary vertex $v\in V(U)-V(A)$ . First we consider the case where $\Gamma(A\cup v)\neq U$ . The number of $U$ -extensions of $\mathcal{A}$ equals the sum over all $\gamma\in\{\text{$ \Gamma(A\cup v) $-extensions of$ \mathcal{A} $}\}$ of the number of $U$ -extensions of $\gamma$ . Clearly $A$ is a $\Gamma(A\cup v)$ -base, and Lemma 6.8 implies that $\Gamma(A\cup v)$ is a $U$ -base. It follows from our assumptions that $v(A)<v(\Gamma(A\cup v))<v(U)$ , so we can apply the inductive hypothesis twice: w.h.p. $\mathcal{A}$ has $\tilde{O}(n^{\Delta(\Gamma(A\cup v))-\Delta(A)})$ extensions to $\Gamma(A\cup v)$ , each of which has $\tilde{O}(n^{\Delta(U)-\Delta(\Gamma(A\cup v))})$ extensions to $U$ , and the result follows.

Now assume that $\Gamma(A\cup v)=U$ . Lemma 6.10 implies that $\mathcal{A}\cup v_{i}$ has $\tilde{O}(1)$ $U$ -extensions w.h.p. for any $i$ , so it suffices to show that w.h.p. there are $\tilde{O}(n^{\Delta(U)-\Delta(A)})$ values of $i$ such that $\mathcal{A}\cup v_{i}$ extends to $U$ . Let $\mathbf{W}=\mathbf{X}[u_{i}\mid u\in V(U)-v,i\in[n^{\alpha(u)}]]$ , and if $\mathcal{A}$ has $\tilde{O}\mathopen{}\mathclose{{}\left(n^{\Delta(U-v)-\Delta(A)}}\right)$ extensions to $U-v$ when $\mathbf{W}=W$ then let $W$ be “okay”. Since $A$ is a $(U-v)$ -base, $\mathbf{W}$ is okay w.h.p. by the inductive hypothesis. Let $\mathbf{Z}_{i}=\mathbbm{1}\!\{\text{$ \mathcal{A}\cup v_{i} $extends to$ U $}\}$ , and let $E$ be the event that $\sum_{i}\mathbf{Z}_{i}>\tilde{O}(n^{\Delta(U)-\Delta(A)})$ . Then,

[TABLE]

so it suffices to prove that $P(E\mid\mathbf{W}=W)\leq n^{-\omega(1)}$ for all okay $W$ .

The $\mathbf{Z}_{i}$ are independent Bernoulli random variables (given $W$ ). By a union bound, $\mathbb{E}[\mathbf{Z}_{i}]$ is at most the number of $(U-v)$ -extensions of $\mathcal{A}$ times the probability that the requisite edges between any one of them and $v_{i}$ are in $\mathbf{X}$ , i.e.

[TABLE]

Since $A$ is a $U$ -base, $\Delta(U)-\Delta(A)\geq 0$ , so it follows from Lemma 6.4 that $\sum_{i=1}^{n^{\alpha(v)}}\mathbf{Z}_{i}$ is $\tilde{O}(n^{\Delta(U)-\Delta(A)})$ w.h.p. ∎

*Remark**.*

It follows from Lemma C.1 that Lemma 6.11 is essentially tight.

Now we prove that $\mathbf{X}_{\Delta,n}$ is good w.h.p.:

Proof of Theorem 6.3.

Let $A\subseteq U$ , $\mathcal{A}\in\mathrm{Sub}_{n}(A)$ and $v\in V(U)-V(A)$ . By a union bound it suffices to prove that w.h.p. there are $\tilde{O}(n^{\Delta^{*}(A\cup v)-\Delta^{*}(A)})$ values of $i$ such that $\mathcal{A}\cup v_{i}$ extends to $U$ . The number of such $i$ is at most the number of $i$ such that $\mathcal{A}\cup v_{i}$ extends to $\Gamma(A\cup v)$ , which is at most the number of $\Gamma(A\cup v)$ -extensions of $\mathcal{A}$ . Since $\Gamma(A)\subseteq\Gamma(A\cup v)$ (Lemma 6.9), this equals the sum over all $\gamma\in\{\text{$ \Gamma(A) $-extensions of$ \mathcal{A} $}\}$ of the number $\mathbf{E}_{\gamma}$ of $\Gamma(A\cup v)$ -extensions of $\gamma$ .

It follows from Lemma 6.10 that $\mathcal{A}$ has $\tilde{O}(1)$ extensions to $\Gamma(A)$ w.h.p. (To see this, note that if $A\subseteq H\subset\Gamma(A)$ then $\Delta(H)\geq\Delta^{*}(A)=\Delta(\Gamma(A))$ (Lemma 6.7), and if $\Delta(H)=\Delta^{*}(A)$ then it follows from the definition of $\Gamma(A)$ that $\Gamma(A)\subseteq H$ , a contradiction.) Since $\Gamma(A)$ is a $\Gamma(A\cup v)$ -base (Lemma 6.8), it follows from Lemma 6.11 that any $\mathbf{E}_{\gamma}$ is $\tilde{O}(n^{\Delta(\Gamma(A\cup v))-\Delta(\Gamma(A))})$ w.h.p. ( $=\tilde{O}(n^{\Delta^{*}(A\cup v)-\Delta^{*}(A)})$ by Lemma 6.7). ∎

6.2 The Circuit

If $D$ is a data structure then let $|D|$ denote the number of bits used to represent it according to whatever schema we describe. If $A$ is a bit array and $b$ is a bit then let $(A\vee b)_{i}=A_{i}\vee b$ and $(A\wedge b)_{i}=A_{i}\wedge b$ for all $i\in[|A|]$ . When there is a null element we represent it by the all-zeros string.

We now prove Theorem 6.1, i.e. that there exists an $\mathrm{AC}^{0}$ circuit with $\tilde{O}(n^{\kappa_{\Delta}(G)+3})$ wires that solves $\text{$ G $-$ \mathsf{SUB} $}(\mathbf{X}_{\Delta,n})$ w.h.p. Since $\mathbf{X}_{\Delta,n}$ is good w.h.p. (Theorem 6.3) it suffices to prove the existence of a small $\mathrm{AC}^{0}$ circuit $\mathsf{C}$ such that $P_{X\sim\mathbf{X}_{\Delta,n}}(\mathsf{C}(X)=\text{$ G $-$ \mathsf{SUB} $}(X)\mid\text{$ X $is good})=1-n^{-\omega(1)}$ . By Yao’s Principle [Yao77] it suffices to prove the existence of a small, random $\mathrm{AC}^{0}$ circuit $\mathbf{C}$ such that $P(\mathbf{C}(X)=\text{$ G $-$ \mathsf{SUB} $}(X))=1-n^{-\omega(1)}$ for all fixed good $X$ . More precisely,

[TABLE]

The following result is essentially implicit in [LRR17] (as is the argument above) and helps keep the random circuit small:

Lemma 6.12 (Random Hashing).

Let $S$ be a set containing a null element, and assume all elements of $S$ are represented using the same number of bits. Let $l=l(n)\leq n^{O(1)}$ and $m=m(n)$ be functions of $n$ . Then there exists a random $\mathrm{AC}^{0}$ circuit $\mathbf{C}:S^{l}\rightarrow S^{\tilde{O}(m)}$ such that if $A$ is an array of $l$ values in $S$ , of which all but at most $m$ are null, then $\mathbf{C}$ has at most $|A|n^{o(1)}$ gates and $|A|\tilde{O}(l/m)$ wires, and w.h.p. the multiset of non-null elements of $\mathbf{C}(A)$ is the same as that of $A$ .

We remark that Lemma 6.12 will only be called with $l\leq\tilde{O}(n)$ .

Proof.

The result is trivial if $l\leq m$ (simply return $A$ ) so assume otherwise. Let $\mathbf{h}:[l]\rightarrow[m]$ be a uniform random function. Let $\mathbf{B}$ be an $m\times\tilde{O}(1)$ array of values in $S$ , where $\mathbf{B}[p,q]$ is the $q$ ’th non-null element of $\mathbf{A}^{(p)}\coloneqq A[\mathbf{h}^{-1}(p)]$ if this set has at least $q$ elements, and $\mathbf{B}[p,q]$ is null otherwise. Each of the at most $m$ non-null elements of $A$ is independently in $\mathbf{A}^{(p)}$ with probability $1/m$ , so for any particular $p$ , the sub-array $\mathbf{B}[p,:]$ is large enough to store the non-null elements of $\mathbf{A}^{(p)}$ w.h.p. (Lemma 6.4). It follows from a union bound that $\mathbf{B}$ has the same non-null elements as $A$ w.h.p. Also assume that $|\mathbf{h}^{-1}(p)|$ is $\tilde{O}(l/m)$ for all $p$ ; this occurs w.h.p. by Lemma 6.4. Under these conditions it suffices to compute $\mathbf{B}$ , and this can be done as follows.

For $x\in\{0,1\}^{N}$ let $T_{k}^{N}(x)=\mathbbm{1}\!\{\text{$ x $has at least$ k $ones}\}$ . Then $\mathbf{J}^{(p)}[i]\coloneqq\mathbbm{1}\!\{\mathbf{A}^{(p)}[i]\neq\mathrm{null}\}$ can be computed by applying a single OR gate to all elements of $\mathbf{A}^{(p)}[i]$ , and

[TABLE]

Fact 6.13 ([Hås+94, Theorem 6]).

If $k=\lfloor\log^{\gamma}N\rfloor$ for constant $\gamma$ , then $T_{k}^{N}$ can be computed for $m=\lfloor\gamma\rfloor+1$ by monotone unbounded fan-in circuits of depth $m+2$ with $2^{O(\log^{\gamma/m}N\log\log N)}$ gates, where $\gamma/m<1$ , and $O(N\log^{2\gamma+2}N)$ wires.

Let $N=\tilde{O}(l/m)=n^{O(1)}$ , and let $\gamma$ be a constant such that the dimensions of $\mathbf{B}$ are at most $m\times k$ where $k=\lfloor\log^{\gamma}N\rfloor$ . Let $\mathsf{T}$ be the $N^{o(1)}$ -size (hence $n^{o(1)}$ -size) circuit from Fact 6.13 that computes $T_{k}^{N}$ . Observe that $T_{q}^{i}(x)=\mathsf{T}(x,y)$ where $y\in\{0,1\}^{N-i}$ is an arbitrary fixed string with exactly $k-q$ ones that can be hard-coded in. Therefore $\mathbf{B}[p,q]$ can be computed by an $\mathrm{AC}^{0}$ circuit of size $\sum_{i\in[|\mathbf{h}^{-1}(p)|]}\mathopen{}\mathclose{{}\left(n^{o(1)}+|\mathbf{A}^{(p)}[i]|}\right)\leq\mathopen{}\mathclose{{}\left|\mathbf{A}^{(p)}}\right|n^{o(1)}$ . Summing over $p$ and $q$ , the total number of gates is $|A|n^{o(1)}$ . To count wires instead of gates, replace $n^{o(1)}$ with $\tilde{O}(N)=\tilde{O}(l/m)$ . ∎

Given $H\subseteq G$ and an ordering $\pi=(\pi^{1},\dotsc,\pi^{v(H)})$ of $V(H)$ , we can represent $\mathrm{Sub}_{X}(H)$ as a tree in the following way. Start with a rooted, depth- $v(H)$ tree (meaning the root has depth 0 and the leaves have depth $v(H)$ ) in which each interior node has $n$ unordered children labeled $1,\dotsc,n$ . Then take the induced subtree of this tree on the union of all root-to-leaf paths $(\mathrm{root},l_{1},\ldots,l_{v(H)})$ such that666Recall that $(\pi^{j})_{l_{j}}$ is a $\pi^{j}$ -colored vertex in $X$ . $\pi^{1}_{l_{1}},\ldots,\pi^{v(H)}_{l_{v(H)}}$ are the vertices of an $H$ -colored subgraph of $X$ .

With respect to an implicit $H$ and $\pi$ , let $\delta_{i}=\Delta^{*}_{H}(\pi^{1}\cup\dotsb\cup\pi^{i})$ for $0\leq i\leq v(H)$ , and let $\phi_{i}=\delta_{i+1}-\delta_{i}$ for $0\leq i<v(H)$ .

Lemma 6.14.

$0\leq\phi_{i}\leq 1$ * for all $i$ .*

Proof.

Clearly $\delta_{i}\leq\delta_{i+1}$ . Let $A\subseteq H$ such that $\pi^{1},\dotsc,\pi^{i}\in V(A)$ and $\Delta(A)=\delta_{i}$ . Then $\delta_{i+1}\leq\Delta(A\cup\pi^{i+1})\leq\Delta(A)+\alpha(\pi^{i+1})\leq\delta_{i}+1$ . ∎

Let $T=T(H,\pi)$ be a depth- $v(H)$ tree in which each node at depth $i<v(H)$ has $n^{\phi_{i}}\log^{c_{i}}n$ children, where $c_{i}$ is a sufficiently large constant. Each non-root node $N$ has a label $\mathcal{L}(N)\in\{\mathrm{null}\}\cup[n]$ , and the root is labeled “root”. It is required that if we ignore the null nodes of $T$ , then $T$ is isomorphic to the tree representation of $\mathrm{Sub}_{X}(H)$ described above.

If the underlying tree structure of $T$ (that is, everything except the labels) is implicit, then we can represent $T$ by an array of values in $\{\mathrm{null}\}\cup[n]$ , indexed by the nodes of $T$ . Each of these values can be associated with a bit string in a natural way. We will consider circuits that compute $T$ according to this representation.

Let $S$ be an immediate subtree of $T$ (resp. of a node $N$ ), denoted $S\in T$ (resp. $S\in N$ ), if $S$ ’s root is a child of $T$ ’s root (resp. of $N$ ). Any subtree is considered to have the same label as its root.

Lemma 6.15.

$|T|$ * is $\tilde{O}(n^{\Delta(H)})$ .*

Proof.

$\delta_{0}=\Delta(\emptyset)=0$ and $\delta_{v(H)}=\Delta^{*}_{H}(V(H))=\Delta(H)$ . It takes $\tilde{O}(1)$ bits to store an element of $[n]^{V(H)}$ , and each $\phi_{i}$ is nonnegative (Lemma 6.14), so

[TABLE]

Lemma 6.16.

For all $H\subseteq G$ and orderings $\pi,\pi^{\prime}$ of $V(H)$ , there exists a random $\mathrm{AC}^{0}$ circuit, independent of $X$ , with $\tilde{O}(n^{\Delta(H)+2})$ wires, that computes $T(H,\pi^{\prime})$ from $T(H,\pi)$ w.h.p.

Proof.

Assume that $\pi$ and $\pi^{\prime}$ differ only in positions $d$ and $d+1$ . (The general case can be reduced to at most $\binom{v(H)}{2}$ copies of this circuit in succession.) Define $\delta_{i}^{\prime}$ and $\phi_{i}^{\prime}$ analogously to $\delta_{i}$ and $\phi_{i}$ , but with respect to $\pi^{\prime}$ rather than $\pi$ . Clearly $\delta_{i}=\delta_{i}^{\prime}$ for $i\neq d$ , so $\phi_{i}=\phi_{i}^{\prime}$ for $i\notin\{d-1,d\}$ .

For each depth- $(d-1)$ node $N$ of $T(H,\pi)$ , in parallel, do the following. For $\sigma\in N,j\in[n]$ let $\tau^{\prime}_{\sigma j}=\bigvee_{\tau\in\sigma}\mathopen{}\mathclose{{}\left((\mathcal{L}(\tau)=j)\wedge\tau^{(\mathcal{L}(\sigma))}}\right)$ , where $\tau^{(\mathcal{L}(\sigma))}$ is formed from $\tau$ by replacing its (root’s) label with $\mathcal{L}(\sigma)$ . Let $\sigma^{\prime}_{j}$ be the tree whose immediate subtrees are $\tau^{\prime}_{\sigma j}$ for $\sigma\in N$ , and whose label is $\mathopen{}\mathclose{{}\left(\bigvee_{\sigma\in N}\bigvee_{\tau\in\sigma}(\mathcal{L}(\tau)=j)}\right)\wedge\overline{j}$ where $\overline{j}$ is the bit-string representation of $j$ . Hash the number of immediate subtrees of $\sigma_{j}^{\prime}$ down to $\tilde{O}(n^{\phi_{d}^{\prime}})$ for each $j$ in parallel, and hash the number of $\sigma_{j}^{\prime}$ down to $\tilde{O}(n^{\phi_{d-1}^{\prime}})$ . (The hashing uses Lemma 6.12 and succeeds w.h.p. because $X$ is good; also note that $\phi_{d}+\phi_{d-1}=\delta_{d+1}-\delta_{d-1}=\phi_{d}^{\prime}+\phi_{d-1}^{\prime}$ .) Finally, the new children of $N$ are the remaining $\sigma^{\prime}_{j}$ .

Computing $\tau^{\prime}_{\sigma j}$ takes $\tilde{O}(\sum_{\tau\in\sigma}|\tau|)=\tilde{O}(|\sigma|)$ wires, so computing $\sigma^{\prime}_{j}$ takes $\tilde{O}(\sum_{\sigma\in N}|\sigma|)=\tilde{O}(|N|)$ wires, and doing this for all $N$ and $j$ takes $\tilde{O}(n|T|)=\tilde{O}(n^{\Delta(H)+1})$ wires (Lemma 6.15). The hashing increases the number of wires by a factor of $\tilde{O}(n)$ . ∎

For $uv\in E(G)$ we can construct $T(uv)$ as follows. Suppose we’re given the adjacency matrix $A\in\{0,1\}^{n^{\alpha(u)}\times n^{\alpha(v)}}$ such that $A_{ij}=\mathbbm{1}\!\{u_{i}v_{j}\in E(X)\}$ . Let $\tau^{\prime}_{ij}=A_{ij}\wedge\overline{i}$ . Let $\sigma^{\prime}_{j}$ be the tree with children $\tau^{\prime}_{ij}$ for $i\in[n^{\alpha(u)}]$ , and label $\mathopen{}\mathclose{{}\left(\bigvee_{i}A_{ij}}\right)\wedge\overline{j}$ . This setup is equivalent to the situation immediately before the hashing in the proof of Lemma 6.16, and the rest of the construction is the same. This takes $\tilde{O}(n^{3})$ wires, including the hashing.

Lemma 6.17.

For all $H,H^{\prime}\subseteq G$ and orderings $\pi$ and $\pi^{\prime}$ of $V(H)$ and $V(H^{\prime})$ respectively, there exists a random $\mathrm{AC}^{0}$ circuit, independent of $X$ , with $\tilde{O}(n^{\max(\Delta(H),\Delta(H^{\prime}))+2})$ wires, that computes $T(H\cup H^{\prime},\hat{\pi})$ from $T(H,\pi)$ and $T(H^{\prime},\pi^{\prime})$ w.h.p. for some $\hat{\pi}$ .

Proof.

Let $T=T(H,\pi)$ and $T^{\prime}=T(H^{\prime},\pi^{\prime})$ . By Lemma 6.16 we can assume without loss of generality that $\{\pi^{1},\dotsc,\pi^{v(H\cap H^{\prime})}\}=\{\pi^{\prime 1},\dotsc,\pi^{\prime v(H\cap H^{\prime})}\}=V(H\cap H^{\prime})=V(H)\cap V(H^{\prime})$ , and that $\pi^{k}=\pi^{\prime k}=\hat{\pi}^{k}$ for $k\in[v(H\cap H^{\prime})]$ . Define $\phi^{\prime}$ and $\hat{\phi}$ with respect to $(H^{\prime},\pi^{\prime})$ and $(H\cup H^{\prime},\hat{\pi})$ respectively.

Let $\psi_{i}=\min(\phi_{i},\phi_{i}^{\prime})$ . For $0\leq d\leq v(H\cap H^{\prime})$ let $S_{d}$ be a depth- $d$ tree in which each node at depth $i<d$ (including $i=0$ ) has $\tilde{O}(n^{\psi_{i}})$ children. Again, each non-root node $N$ of $S_{d}$ has a label $\mathcal{L}(N)\in\{\mathrm{null}\}\cup[n]$ , and the root is labeled “root”. It is required that if we ignore null nodes, then $S_{d}$ is isomorphic to the intersection of the depth- $d$ truncations of $T$ and $T^{\prime}$ . Furthermore, each leaf $\ell$ of $S_{d}$ is associated with the pair $(\sigma,\sigma^{\prime})$ of subtrees of $T$ and $T^{\prime}$ such that the $\mathrm{root}(S)$ -to- $\ell$ path in $S$ , the $\mathrm{root}(T)$ -to- $\mathrm{root}(\sigma)$ path in $T$ , and the $\mathrm{root}(T^{\prime})$ -to- $\mathrm{root}(\sigma^{\prime})$ path in $T^{\prime}$ are all the same sequence of labels.

The tree $S_{0}$ is the single node $(T,T^{\prime})$ , and we can compute $S_{d+1}$ from $S_{d}$ by doing the following for each leaf $(\sigma,\sigma^{\prime})$ of $S_{d}$ in parallel. Assume without loss of generality that $\psi_{d}=\phi_{d}$ . (If $\psi_{d}=\phi_{d}^{\prime}$ , reverse the roles of $\sigma$ and $\sigma^{\prime}$ in the following construction.) For $\tau\in\sigma$ let $\rho_{\tau}$ be the immediate subtree of $\sigma^{\prime}$ with the same label as $\tau$ (if this exists), i.e. $\rho_{\tau}=\bigvee_{\tau^{\prime}\in\sigma^{\prime}}((\mathcal{L}(\tau)=\mathcal{L}(\tau^{\prime}))\wedge\tau^{\prime})$ . Replace $(\sigma,\sigma^{\prime})$ with a new node with children $(\rho_{\tau}\neq\mathrm{null})\wedge(\tau,\rho_{\tau})$ for all $\tau\in\sigma$ . Assign the node replacing $(\sigma,\sigma^{\prime})$ the same label as $(\sigma,\sigma^{\prime})$ , and assign $(\tau,\rho_{\tau})$ the same label as $\tau$ and $\rho_{\tau}$ .

Computing $\rho_{\tau}$ takes $\tilde{O}\mathopen{}\mathclose{{}\left(\sum_{\tau^{\prime}\in\sigma^{\prime}}|\tau^{\prime}|}\right)=\tilde{O}(|\sigma^{\prime}|)$ wires, and there are at most $n$ values of $\tau$ (Lemma 6.14), so computing $\rho$ takes $\tilde{O}(n|\sigma^{\prime}|)$ wires. Given $\rho$ , computing the leaves of the replacement for $(\sigma,\sigma^{\prime})$ takes $O\mathopen{}\mathclose{{}\left(\sum_{\tau\in\sigma}(|\tau|+|\rho_{\tau}|)}\right)=O(|\sigma|+|\sigma^{\prime}|)$ wires. Since the roles of $\sigma$ and $\sigma^{\prime}$ might be reversed above, all of this takes at most $\tilde{O}(n|\sigma|+n|\sigma^{\prime}|)$ wires. Since $S_{d}$ has $\tilde{O}\mathopen{}\mathclose{{}\left(n^{\sum_{i<d}\psi_{i}}}\right)$ leaves, the number of wires is at most

[TABLE]

Let $S=S_{v(H\cap H^{\prime})}$ . For $d$ from $v(H\cap H^{\prime})-1$ down to 0, for each depth- $d$ node $N$ in $S$ , hash (Lemma 6.12) the number of immediate subtrees of $N$ down from $\tilde{O}(n^{\psi_{d}})$ to $\tilde{O}(n^{\hat{\phi}_{d}})$ , and if all of $N$ ’s children are null and $d>0$ then set $N$ to null. (We remark that $\hat{\phi}_{d}\leq\psi_{d}$ by Lemma C.3.) This takes $\tilde{O}(|S|n)\leq\tilde{O}((|T|+|T^{\prime}|)n)=\tilde{O}(n^{\max(\Delta(H),\Delta(H^{\prime}))+1})$ wires (Lemma 6.15). By induction on $d$ , a node retains its label if and only if it should retain its label in $T(H\cup H^{\prime},\hat{\pi})$ , so the hashing succeeds w.h.p. because $X$ is good.

Finally, for each leaf $(\tau,\tau^{\prime})$ of $S$ , append a copy of $\tau^{\prime}$ to each leaf of $\tau$ , and put this in place of $(\tau,\tau^{\prime})$ in $S$ . This operation is purely semantic and requires no wires. The resulting tree does in fact have the proper number of children per node to be $T(H\cup H^{\prime},(\pi^{1},\dotsc,\pi^{v(H)},\pi^{\prime v(H\cap H^{\prime})+1},\dotsc,\pi^{\prime v(H^{\prime})}))$ by Lemma C.4,777For $v(H\cap H^{\prime})\leq k<v(H)$ apply Lemma C.4 with $L=H,R=H^{\prime},A=H[\pi^{1},\dotsc,\pi^{k}],B=H[\pi^{1},\dotsc,\pi^{k+1}],C=A\cap B$ , and for $v(H\cap H^{\prime})\leq k<v(H^{\prime})$ apply Lemma C.4 with $L=H^{\prime},R=H,A=H^{\prime}[\pi^{\prime 1},\dotsc,\pi^{\prime k}],B=H^{\prime}[\pi^{\prime 1},\dotsc,\pi^{\prime k+1}],C=H$ . but without this knowledge we could instead use hashing on $\tau$ and $\tau^{\prime}$ as above, without knowing whether or not it succeeds vacuously. ∎

For each successive $H$ in an optimal union sequence, compute $T(H)$ as described above, and then apply a single OR gate to all leaves of $T(G)$ .

Acknowledgments

Thanks to Benjamin Rossman for introducing me to this topic, and for having many helpful discussions about the research and about drafts of this paper. Thanks to Henry Yuen and the anonymous reviewers for their feedback as well. Part of this work was done while the author was visiting the Simons Institute for the Theory of Computing.

Appendix A Equivalence of Threshold Weightings and Markov Chains

Theorem A.1.

For any threshold weighting $(\alpha,\beta)\in\theta(G)$ there exists a function $M:V(G)\times V(G)\rightarrow\mathbb{R}_{\geq 0}$ such that

$M(u,u)=0$ * for all $u$ ,* 2. 2.

$M(u,v)+M(v,u)=\beta(uv)$ * for all $u\neq v$ , and* 3. 3.

$\sum_{v\in V(G)}M(u,v)=\alpha(u)$ * for all $u$ .*

Proof.

Let $\Delta=(\alpha,\beta)$ . The proof is by induction on $v(G)$ . If $G$ is a single vertex $u$ then $\theta(G)$ consists only of $\alpha=0$ , so setting $M(u,u)=0$ satisfies the requirements. Now assume $v(G)>1$ . For $A,B\subseteq G$ let $M(A,B)=\sum_{u\in V(A),v\in V(B)}M(u,v)$ (once $M(u,v)$ is specified). Assume without loss of generality that $G$ is a clique, since we can assign $\beta=0$ on nonexistent edges.

Let $H=\mathrm{argmin}_{F\subset G,0<v(F)<v(G)}\Delta(F)$ , where ties are broken arbitrarily subject to $H$ being an induced subgraph of $G$ . Since $\Delta(G)=0$ ,

[TABLE]

so for $u\in V(H),v\in V(G-H)$ we can define $M(u,v)\in[0,\beta(uv)]$ such that $M(H,G-H)=\Delta(H)$ . For $u\in V(H)$ let $\alpha_{H}(u)=\alpha(u)-M(u,G-H)$ , and let $\Delta_{H}$ be the restriction of $\alpha_{H}-\beta$ to subgraphs of $H$ . For any $\emptyset\subset F\subseteq H$ ,

[TABLE]

with equality if $F=H$ . Therefore $\Delta_{H}$ is a threshold weighting on $H$ . Recursively define a restriction of $M$ to $V(H)\times V(H)$ such that this restriction is a Markov Chain on $H$ that is equivalent to $\Delta_{H}$ .

For $u\in V(G-H),v\in V(H)$ let $M(u,v)=\beta(uv)-M(v,u)$ . For $u\in V(G-H)$ let $\alpha_{G-H}(u)=\alpha(u)-M(u,H)$ , and let $\Delta_{G-H}$ be the restriction of $\alpha_{G-H}-\beta$ to subgraphs of $G-H$ . Then,

[TABLE]

For any $\emptyset\subset F\subset G-H$ , if $v(F)<v(G-H)$ then

[TABLE]

and if $v(F)=v(G-H)$ then $\Delta_{G-H}(F)\geq\Delta_{G-H}(G-H)=0$ . Therefore $\Delta_{G-H}$ is a threshold weighting on $G-H$ . Recursively define a restriction of $M$ to $V(G-H)\times V(G-H)$ such that this restriction is a Markov Chain on $G-H$ that is equivalent to $\Delta_{G-H}$ .

We now verify that $M(u,G)=\alpha(u)$ for all $u$ ; the other requirements follow easily by induction. If $u\in V(H)$ then $M(u,H)=\alpha_{H}(u)$ by induction, and $M(u,G-H)=\alpha(u)-\alpha_{H}(u)$ by the definition of $\alpha_{H}$ . Similarly, if $u\in V(G-H)$ then $M(u,G-H)=\alpha_{G-H}(u)$ and $M(u,H)=\alpha(u)-\alpha_{G-H}(u)$ . Therefore $M(u,G)=M(u,H)+M(u,G-H)=\alpha(u)$ for all $u$ . ∎

Appendix B Proof that $\kappa\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)$ is $O(q^{d}/d)$ for all $q$

The proof below is self-contained; however in places with clear analogues in Section 5.2 we will give less detailed explanations of the intermediate steps and intuition.

Fix $q$ and $d$ . Let a query tree be a binary tree in which each node is labeled with some $U_{1}\times\dotsb\times U_{d}$ where each $U_{i}\subseteq[q]$ . The root is labeled with $[q]^{d}$ , each leaf is labeled with a singleton set, and for any interior node $N$ labeled with $U_{1}\times\dotsb\times U_{d}$ there exist $i\in[d]$ and $k\in U_{i}$ such that the left and right children of $N$ are labeled with $U_{1}\times\dotsb\times U_{i-1}\times(U_{i}-k)\times U_{i+1}\times\dotsb\times U_{d}$ and $U_{1}\times\dotsb\times U_{i-1}\times\{k\}\times U_{i+1}\times\dotsb\times U_{d}$ respectively. (In the latter case, $U_{i}$ necessarily has at least two elements.)

With respect to an implicit query tree $T$ , let $\ell_{0},\dotsc,\ell_{q^{d}-1}$ be the leaves in increasing order from left to right, and for $0\leq a\leq q^{d}$ let $G(a)=K_{q}^{d}[\ell_{0},\dotsc,\ell_{a-1}]$ . Let $\mu_{T}=\max_{a}\Delta_{\mathrm{o}}(G(a))$ and let $\mu$ be the maximum of $\mu_{T}$ over all query trees $T$ . For a threshold weighting $\Delta\in\theta\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)$ and $H\subseteq K_{q}^{d}$ let $\kappa_{\Delta}(H)=\min_{S\in\mathrm{Seq}(H)}\max_{F\in S}\Delta(F)$ , and if $H$ is a single-vertex graph or the empty graph then let $\kappa_{\Delta}(H)=0$ . By Item 3.8(i) it suffices to prove that $\kappa_{\Delta}\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)$ is $O(q^{d}/d)$ for all threshold weightings $\Delta=(1,\beta)$ .

Lemma B.1.

Fix a query tree $T$ . Let $0\leq a<b\leq q^{d}$ such that $\ell_{a},\dotsc,\ell_{b-1}$ are exactly the leaves descended from some node of $T$ . Let $\Delta=(1,\beta)\in\theta\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)$ such that $\beta(G(a))\geq\beta_{\mathrm{o}}(G(a))$ and $\beta(G(b))\geq\beta_{\mathrm{o}}(G(b))$ , and $\kappa_{\Delta}(G(a))\leq 2\mu$ . Then $\kappa_{\Delta}(G(b))\leq 2\mu$ .

Proof.

Let $N$ be the node of $T$ such that the leaves descended from $N$ are exactly $\ell_{a},\dotsc,\ell_{b-1}$ . Let $U_{1}\times\dotsb\times U_{d}$ be the label of $N$ . Let $B=G(b)-G(a)=K_{q}^{d}[U_{1}\times\dotsb\times U_{d}]=K_{q}^{d}[\ell_{a},\dotsc,\ell_{b-1}]$ .

The proof is by induction on $\sum_{i}(|U_{i}|-1)$ , for all query trees $T$ . (It follows from the definitions that $|U_{i}|\geq 1$ , with equality for all $i$ if and only if $N$ is a leaf.) In the inductive step we handle separately the cases where $\beta(B)\geq\beta_{\mathrm{o}}(B)$ and $\beta(B)<\beta_{\mathrm{o}}(B)$ . The base case is a special case of the former because if $B$ is a single vertex then $\beta(B)$ and $\beta_{\mathrm{o}}(B)$ are both zero.

Case 1: $\beta(B)\geq\beta_{\mathrm{o}}(B)$ . If $B$ is a single vertex then $\kappa_{\Delta}(B)=0\leq 2\mu$ ; we now obtain the same result in the case where $B$ is not a single vertex. Let $\mathcal{I}=\{i\in[d]\mid|U_{i}|\geq 2\}$ , and note that $\mathcal{I}$ is nonempty. For $i\in\mathcal{I}$ and $k\in U_{i}$ let $B(i,k)=B[v\in V(B)\mid v_{i}\neq k]$ . Choose a pair $(\mathbf{i},\mathbf{k})$ uniformly at random out of all pairs $(i,k)$ such that $i\in\mathcal{I}$ and $k\in U_{i}$ . Each edge in $B$ is also in $B(\mathbf{i},\mathbf{k})$ with the same probability $p=1-(|\mathcal{I}|+1)/\sum_{i\in\mathcal{I}}|U_{i}|$ (since adjacent vertices differ in a unique coordinate), so by linearity of expectation,

[TABLE]

Therefore $\beta(B(i,k))\geq\beta_{\mathrm{o}}(B(i,k))$ for some fixed $i$ and $k$ .

Now our claim that $\kappa_{\Delta}(B)\leq 2\mu$ follows from two applications of the inductive hypothesis. Let $T^{\prime}$ be any query tree in which the sequence of labels along the path from the root to the leftmost leaf includes $U_{1}\times\dotsb\times U_{d}$ followed by $U_{1}\times\dotsb\times(U_{i}-k)\times\dotsb\times U_{d}$ . With respect to $T^{\prime}$ , an application of the inductive hypothesis with $a^{\prime}=0$ and $b^{\prime}=(1-1/|U_{i}|)\prod_{j}|U_{j}|$ reveals that $\kappa_{\Delta}(B(i,k))\leq 2\mu$ , and then an application of the inductive hypothesis with $a^{\prime\prime}=(1-1/|U_{i}|)\prod_{j}|U_{j}|$ and $b^{\prime\prime}=\prod_{j}|U_{j}|$ ( $=b-a$ ) reveals that $\kappa_{\Delta}(B)\leq 2\mu$ .

The rest of the proof is essentially identical to the case $q=2$ . Let $S$ be an optimal (with respect to $\Delta$ ) union sequence for $G(a)$ , followed by an optimal union sequence for $B$ , followed by $G(a)\cup B,G(a)\cup B\cup e_{1},\dotsc,G(a)\cup B\cup\{e_{j}\}$ , where the $\{e_{j}\}$ are the edges between $G(a)$ and $B$ in $K_{q}^{d}$ . (If $G(a)$ or $B$ lacks edges then omit certain graphs from this sequence.) Then,

[TABLE]

We proceed to bound each of these three terms by $2\mu$ , completing the proof. We have assumed that $\kappa_{\Delta}(G(a))\leq 2\mu$ , and proved that $\kappa_{\Delta}(B)\leq 2\mu$ . We have also assumed that $\beta(G(a))\geq\beta_{\mathrm{o}}(G(a))$ , and since $\Delta$ and $\Delta_{\mathrm{o}}$ both evaluate to 1 on all vertices, it follows that $\Delta(G(a))\leq\Delta_{\mathrm{o}}(G(a))\leq\mu$ (with the last step following from the definition of $\mu$ ). Similarly, $\Delta(B)\leq\Delta_{\mathrm{o}}(B)\leq\mu$ , and it follows that $\Delta(G(a))+\Delta(B)\leq 2\mu$ .

Case 2: $\beta(B)<\beta_{\mathrm{o}}(B)$ . For $i\in\mathcal{I}$ and $k\in U_{i}$ let $H(i,k)=K_{q}^{d}[\ell_{0},\dotsc,\ell_{a-1},V(B(i,k))]$ (where $\mathcal{I}$ and $B(i,k)$ are defined as above). Choose a pair $(\mathbf{i},\mathbf{k})$ uniformly at random out of all pairs $(i,k)$ such that $i\in\mathcal{I}$ and $k\in U_{i}$ . Then there exist $p_{0}>p_{1}>p_{2}\geq 0$ (specifically, $p_{0}=1$ , $p_{1}=1-|\mathcal{I}|/\sum_{i\in\mathcal{I}}|U_{i}|$ , and $p_{2}=1-(|\mathcal{I}|+1)/\sum_{i\in\mathcal{I}}|U_{i}|$ ) such that

[TABLE]

Therefore $\beta(H(i,k))>\beta_{\mathrm{o}}(H(i,k))$ for some fixed $i$ and $k$ .

Preparing to apply the inductive hypothesis, let $T^{\prime\prime}$ be any query tree structured and labeled exactly like $T$ on all ancestors of $\ell_{j}$ for all $j<a$ , and on all ancestors of $N$ , but now the left child of $N$ is labeled with $U_{1}\times\dotsb\times(U_{i}-k)\times\dotsb\times U_{d}$ . With respect to $T^{\prime\prime}$ , an application of the inductive hypothesis with $a^{\prime}=a$ and $b^{\prime}=a+(1-1/|U_{i}|)\prod_{j}|U_{j}|$ reveals that $\kappa_{\Delta}(G(a+(1-1/|U_{i}|)\prod_{j}|U_{j}|))\leq 2\mu$ , and a second application of the inductive hypothesis with $a^{\prime\prime}=a+(1-1/|U_{i}|)\prod_{j}|U_{j}|$ and $b^{\prime\prime}=b$ ( $=a+\prod_{j}|U_{j}|$ ) reveals that $\kappa_{\Delta}(G(b))\leq 2\mu$ . ∎

Lemma B.2.

$\mu$ * is $O(q^{d}/d)$ .*

Proof.

We use a cruder bound here than in the case $q=2$ . Let $T$ be an arbitrary query tree and $a\in[q^{d}]$ . Let $N_{0}$ be the nearest common ancestor of $\ell_{0}$ and $\ell_{a-1}$ . If $N_{0}$ is a leaf then clearly $\Delta_{\mathrm{o}}(G(a))$ is $O(q^{d}/d)$ , so assume otherwise. Let $N_{L}$ and $N_{R}$ be the left and right children of $N_{0}$ , and note that $\ell_{a-1}$ is a descendant of $N_{R}$ . By Eq. 1, since $K_{q}^{d}$ is $(q-1)d$ -regular it suffices to prove that $e(G(a),K_{q}^{d}-G(a))$ is $O(q^{d+1})$ . Suppose $N_{0}$ is labeled with $U_{1}\times\dotsb\times U_{d}$ and $N_{L}$ is labeled with $U_{1}\times\dotsb\times(U_{i}-k)\times\dotsb\times U_{d}$ . Since each vertex in $G(a)$ is a descendant of $N_{0}$ , all edges between $G(a)$ and $K_{q}^{d}-G(a)$ are in one of the following classes:

Edges (in $K_{q}^{d}$ ) between a leaf descended from $N_{0}$ and a leaf not descended from $N_{0}$ . Each leaf descended from $N_{0}$ is adjacent to $\sum_{j=1}^{d}(q-|U_{j}|)$ leaves not descended from $N_{0}$ , so this amounts to $(dq-\sum_{j}|U_{j}|)\prod_{j}|U_{j}|$ edges in total. By the AM-GM inequality, this is at most

[TABLE] 2. 2.

Edges (in $K_{q}^{d}$ ) between a leaf descended from $N_{L}$ and a leaf descended from $N_{R}$ . Each leaf descended from $N_{L}$ is adjacent to one leaf descended from $N_{R}$ , so this amounts to at most $\prod_{j}|U_{j}|\leq q^{d}$ edges in total. 3. 3.

Edges (in $K_{q}^{d}$ ) between a leaf descended from $N_{R}$ that’s in $G(a)$ , and a leaf descended from $N_{R}$ that’s in $K_{q}^{d}-G(a)$ . This is at most what the value of $\mu$ would be if $d$ were $d-1$ instead. (Eliminate coordinate $i$ , and replace $U_{j}$ with $[q]$ for all $j\neq i$ .)

The total number of edges in all classes is therefore $O(q^{d+1}+q^{d}+\dotsb)=O(q^{d+1})$ . ∎

Finally, it follows from Lemmas B.1 and B.2 that $\kappa\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)\leq 2\mu\leq O(q^{d}/d)$ .

*Remark**.*

The above argument holds even if we relax the definition of threshold weightings to allow $\Delta$ to take on negative values (where all definitions in terms of threshold weightings are with respect to this revised definition).

Appendix C Properties of Threshold Weightings and Threshold Random Graphs

Lemma C.1.

If $A\subseteq U\subseteq G$ are fixed graphs, $\Delta\in\theta(G)$ , and $\Delta(A)<\Delta(H)$ for all $H\in(A,U]$ , then conditional on $\mathcal{A}\in\mathrm{Sub}_{\mathbf{X}_{\Delta,n}}(A)$ , there are at least $n^{\Delta(U)-\Delta(A)}(1-o(1))$ $U$ -extensions of $\mathcal{A}$ a.a.s.

Li et al. [LRR17] stated without proof that a similar result can be obtained using Janson’s Inequality [Jan90]:

Fact C.2 (Janson’s Inequality).

Let $\mathbf{B}_{1},\ldots,\mathbf{B}_{\ell}$ be independent Bernoulli random variables, let $W_{1},\ldots,W_{k}\subseteq[\ell]$ , and for $i\in[k]$ let $\mathbf{I}_{i}=\prod_{j\in W_{i}}\mathbf{B}_{j}$ . Also for $i,j\in[k],i\neq j$ let $i\sim j$ if $W_{i}\cap W_{j}\neq\emptyset$ . Let $\mathbf{S}=\sum_{i}\mathbf{I}_{i}$ and $\mu=\mathbb{E}[\mathbf{S}]$ . Then for all $0\leq\epsilon\leq 1$ ,

[TABLE]

Proof of Lemma C.1.

Let $\mathcal{U}_{1},\dotsc,\mathcal{U}_{k}$ be the possible $U$ -extensions of $\mathcal{A}$ , and let $\mathbf{I}_{i}=\mathbbm{1}\!\{\mathcal{U}_{i}\subseteq\mathbf{X}\}$ . Define $\mu$ as in Fact C.2; clearly $\mu=n^{\Delta(U)-\Delta(A)}$ , by reasoning similar to the proof of Lemma 3.5. If $i\sim j$ then the projection of $\mathcal{U}_{i}\cap\mathcal{U}_{j}$ onto $U$ must be some graph in $(A,U)$ , so

[TABLE]

Since $\mu$ is also $o(\mu^{2})$ , it follows that $\mu^{2}\big{/}\mathopen{}\mathclose{{}\left(\mu+\sum_{i\sim j}\mathbb{E}[\mathbf{I}_{i}\mathbf{I}_{j}]}\right)\geq\mu^{2}/o(\mu^{2})=\omega(1)$ , and the result follows from Fact C.2. ∎

Lemma C.3.

For all $A\subseteq B\subseteq F\subseteq H\subseteq G$ and $\Delta\in\theta(G)$ ,

[TABLE]

Proof.

Since $B\subseteq\Gamma_{F}(B)\subseteq\Gamma_{H}(A)\cup\Gamma_{F}(B)$ it follows that $\Delta^{*}_{H}(B)\leq\Delta(\Gamma_{H}(A)\cup\Gamma_{F}(B))$ , and since $A\subseteq\Gamma_{H}(A)$ and $A\subseteq B\subseteq\Gamma_{F}(B)$ it follows that $\Delta^{*}_{F}(A)\leq\Delta(\Gamma_{H}(A)\cap\Gamma_{F}(B))$ . So by Lemmas 6.5 and 6.7,

[TABLE]

Lemma C.4.

Let $\Delta\in\theta(G)$ and assume $L\cap R\subseteq A\subseteq B\subseteq L\subseteq G$ and $L\cap R\subseteq C\subseteq R\subseteq G$ . Then, $\Delta^{*}_{L\cup R}(B\cup C)-\Delta^{*}_{L\cup R}(A\cup C)=\Delta^{*}_{L}(B)-\Delta^{*}_{L}(A)$ .

Proof.

For all $F\in[A,L]$ and $H\in[C,R]$ ,

[TABLE]

so by Lemma 6.5,

[TABLE]

The same reasoning applies with $B$ in place of $A$ , so

[TABLE]

Bibliography29

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[AM 11] Noga Alon and Dániel Marx “Sparse balanced partitions and the complexity of subgraph problems” In SIAM J. Discrete Math. 25.2 , 2011, pp. 631–644 DOI: 10.1137/100812653 · doi ↗
2[AM 85] Noga Alon and Vitali D. Milman “ λ 1 , subscript 𝜆 1 \lambda_{1}, isoperimetric inequalities for graphs, and superconcentrators” In J. Combin. Theory Ser. B 38.1 , 1985, pp. 73–88 DOI: 10.1016/0095-8956(85)90092-9 · doi ↗
3[Ama 10] Kazuyuki Amano “ k 𝑘 k -subgraph isomorphism on AC 0 superscript AC 0 \rm AC^{0} circuits” In Comput. Complexity 19.2 , 2010, pp. 183–210 DOI: 10.1007/s 00037-010-0288-y · doi ↗
4[AYZ 95] Noga Alon, Raphael Yuster and Uri Zwick “Color-coding” In J. ACM 42.4 , 1995, pp. 844–856 DOI: 10.1145/210332.210337 · doi ↗
5[BK 08] Hans L. Bodlaender and Arie M… Koster “Combinatorial optimization on graphs of bounded treewidth” In Comput. J. 51.3 OUP, 2008, pp. 255–269 DOI: 10.1093/comjnl/bxm 037 · doi ↗
6[Bod 98] Hans L. Bodlaender “A partial k 𝑘 k -arboretum of graphs with bounded treewidth” In Theoret. Comput. Sci. 209.1-2 , 1998, pp. 1–45 DOI: 10.1016/S 0304-3975(97)00228-4 · doi ↗
7[CK 06] L. Chandran and Telikepalli Kavitha “The treewidth and pathwidth of hypercubes” In Discrete Math. 306.3 , 2006, pp. 359–365 DOI: 10.1016/j.disc.2005.12.011 · doi ↗
8[EG 04] Friedrich Eisenbrand and Fabrizio Grandoni “On the complexity of fixed parameter clique and dominating set” In Theoret. Comput. Sci. 326.1-3 , 2004, pp. 57–67 DOI: 10.1016/j.tcs.2004.05.009 · doi ↗

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Beating Treewidth for Average-Case Subgraph Isomorphism

Abstract

1 Introduction

Conjecture 1.1**.**

Theorem 1.2** ([Mar10]).**

Theorem 1.3**.**

Theorem 1.4**.**

Theorem 1.5**.**

2 Preliminaries

2.1 Graphs

Definition 2.1** (Colored subgraph isomorphism problem).**

Definition 2.2** (Graph minor).**

Definition 2.3** (Treewidth).**

Fact 2.4** ([AM85]).**

3 The Average-Case Problem and the Parameter κ(G)\kappa(G)κ(G)

3.1 Threshold Random Graphs

Definition 3.1**.**

Example 3.2** (Markov Chains).**

Definition 3.3**.**

Definition 3.4**.**

Lemma 3.5**.**

Proof.

3.2 The Parameter κ(G)\kappa(G)κ(G) and an Algorithm for the Average Case

Definition 3.6** ([LRR17]).**

Theorem 3.7**.**

Proof.

Theorem 3.8** ([LRR17]222Specifically, Corollary 4.2, Theorem 4.9, and Theorem 5.1 of [LRR17] correspond to Items 3.8(i), 3.8(ii) and 3.8(iii) respectively.).**

Corollary 3.9**.**

Proof of 3.9.

4 The Parameter emb(G)\mathit{emb}(G)emb(G) and Proof that emb(G)\mathit{emb}(G)emb(G) is O(κ(G))O(\kappa(G))O(κ(G))

Definition 4.1** (emb(G)\mathit{emb}(G)emb(G)).**

Example 4.2** (emb(Kk)\mathit{emb}(K_{k})emb(Kk​) [Mar10]).**

Remark*.*

Lemma 4.3**.**

Proof.

Proof.

5 Separating κ\kappaκ from Treewidth

5.1 Proof that κ(Kk)=k/4+O(1)\kappa(K_{k})=k/4+O(1)κ(Kk​)=k/4+O(1)

5.2 Proof that \kappa\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right) is O(qd/d)O(q^{d}/d)O(qd/d) if qqq is Even

Remark*.*

Lemma 5.1**.**

Proof.

Lemma 5.2**.**

Proof.

Remark*.*

5.3 Proof that \kappa\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right) is Ω(qd/d)\Omega(q^{d}/d)Ω(qd/d) and \mathit{emb}\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right) is Θ(qd/d)\Theta(q^{d}/d)Θ(qd/d)

Fact 5.3**.**

Proof.

Remark*.*

5.4 Proof that \mathit{tw}\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right) is \Theta\mathopen{}\mathclose{{}\left(q^{d}/\sqrt{d}}\right), Summarized

6 AC0\mathrm{AC}^{0}AC0 Upper Bound

Theorem 6.1**.**

Definition 6.2**.**

Theorem 6.3**.**

Lemma 6.4**.**

Proof.

6.1 Proof of Theorem 6.3

Lemma 6.5**.**

Proof.

Definition 6.6**.**

Lemma 6.7**.**

Proof.

Lemma 6.8**.**

Proof.

Lemma 6.9**.**

Proof.

Lemma 6.10**.**

Proof.

Lemma 6.11**.**

Proof.

Remark*.*

Proof of Theorem 6.3.

6.2 The Circuit

Lemma 6.12** (Random Hashing).**

Conjecture 1.1.

Theorem 1.2 ([Mar10]).

Theorem 1.3.

Theorem 1.4.

Theorem 1.5.

Definition 2.1 (Colored subgraph isomorphism problem).

Definition 2.2 (Graph minor).

Definition 2.3 (Treewidth).

Fact 2.4 ([AM85]).

3 The Average-Case Problem and the Parameter $\kappa(G)$

Definition 3.1.

Example 3.2 (Markov Chains).

Definition 3.3.

Definition 3.4.

Lemma 3.5.

3.2 The Parameter $\kappa(G)$ and an Algorithm for the Average Case

Definition 3.6 ([LRR17]).

Theorem 3.7.

Theorem 3.8 ([LRR17]222Specifically, Corollary 4.2, Theorem 4.9, and Theorem 5.1 of [LRR17] correspond to Items 3.8(i), 3.8(ii) and 3.8(iii) respectively.).

Corollary 3.9.

4 The Parameter $\mathit{emb}(G)$ and Proof that $\mathit{emb}(G)$ is $O(\kappa(G))$

Definition 4.1 ( $\mathit{emb}(G)$ ).

Example 4.2 ( $\mathit{emb}(K_{k})$ [Mar10]).

*Remark**.*

Lemma 4.3.

5 Separating $\kappa$ from Treewidth

5.1 Proof that $\kappa(K_{k})=k/4+O(1)$

5.2 Proof that $\kappa\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)$ is $O(q^{d}/d)$ if $q$ is Even

*Remark**.*

Lemma 5.1.

Lemma 5.2.

*Remark**.*

5.3 Proof that $\kappa\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)$ is $\Omega(q^{d}/d)$ and $\mathit{emb}\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)$ is $\Theta(q^{d}/d)$

Fact 5.3.

*Remark**.*

5.4 Proof that $\mathit{tw}\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)$ is $\Theta\mathopen{}\mathclose{{}\left(q^{d}/\sqrt{d}}\right)$ , Summarized

6 $\mathrm{AC}^{0}$ Upper Bound

Theorem 6.1.

Definition 6.2.

Theorem 6.3.

Lemma 6.4.

Lemma 6.5.

Definition 6.6.

Lemma 6.7.

Lemma 6.8.

Lemma 6.9.

Lemma 6.10.

Lemma 6.11.

*Remark**.*

Lemma 6.12 (Random Hashing).

Fact 6.13 ([Hås+94, Theorem 6]).

Lemma 6.14.

Lemma 6.15.

Lemma 6.16.

Lemma 6.17.

Theorem A.1.

Appendix B Proof that $\kappa\mathopen{}\mathclose{{}\left(K_{q}^{d}}\right)$ is $O(q^{d}/d)$ for all $q$

Lemma B.1.

Lemma B.2.

*Remark**.*

Lemma C.1.

Fact C.2 (Janson’s Inequality).

Lemma C.3.

Lemma C.4.