Anticoncentration for subgraph counts in random graphs

Jacob Fox; Matthew Kwan; Lisa Sauermann

arXiv:1905.12749·math.CO·November 19, 2020

Anticoncentration for subgraph counts in random graphs

Jacob Fox, Matthew Kwan, Lisa Sauermann

PDF

TL;DR

This paper establishes near-optimal bounds on the probability that the count of a fixed subgraph in a random graph falls into a specific small value, advancing understanding of small-ball probabilities in random graph theory.

Contribution

It introduces a novel anticoncentration inequality for almost linear random vectors and applies it to bound subgraph count probabilities in Erdős–Rényi graphs.

Findings

01

Proves that for connected graphs, the probability of exactly x copies is at most n^{1-v(H)+o(1)}.

02

Develops a new anticoncentration inequality for vectors with near-linear behavior.

03

Provides a method to analyze small-ball probabilities for subgraph counts.

Abstract

Fix a graph $H$ and some $p \in (0, 1)$ , and let $X_{H}$ be the number of copies of $H$ in a random graph $G (n, p)$ . Random variables of this form have been intensively studied since the foundational work of Erd\H{o}s and R\'{e}nyi. There has been a great deal of progress over the years on the large-scale behaviour of $X_{H}$ , but the more challenging problem of understanding the small-ball probabilities has remained poorly understood until now. More precisely, how likely can it be that $X_{H}$ falls in some small interval or is equal to some particular value? In this paper we prove the almost-optimal result that if $H$ is connected then for any $x \in N$ we have $Pr (X_{H} = x) \leq n^{1 - v (H) + o (1)}$ . Our proof proceeds by iteratively breaking $X_{H}$ into different components which fluctuate at "different scales", and relies on a new anticoncentration inequality for random vectors that behave…

Equations186

\max_{x\in\mathbb{N}}\Pr\mathopen{}\mathclose{{}\left(X_{H}=x}\right)=O\mathopen{}\mathclose{{}\left(n^{1-v\mathopen{}\mathclose{{}\left(H}\right)}}\right),

\max_{x\in\mathbb{N}}\Pr\mathopen{}\mathclose{{}\left(X_{H}=x}\right)=O\mathopen{}\mathclose{{}\left(n^{1-v\mathopen{}\mathclose{{}\left(H}\right)}}\right),

\max_{x\in\mathbb{Z}}\Pr\mathopen{}\mathclose{{}\left(X_{H}=x}\right)=n^{1-v\mathopen{}\mathclose{{}\left(H}\right)+o\mathopen{}\mathclose{{}\left(1}\right)}.

\max_{x\in\mathbb{Z}}\Pr\mathopen{}\mathclose{{}\left(X_{H}=x}\right)=n^{1-v\mathopen{}\mathclose{{}\left(H}\right)+o\mathopen{}\mathclose{{}\left(1}\right)}.

\Delta_{i}\boldsymbol{f}\mathopen{}\mathclose{{}\left(\boldsymbol{\xi}}\right)=\boldsymbol{f}\mathopen{}\mathclose{{}\left(\xi_{1},\dots,\xi_{i-1},1,\xi_{i+1}.\dots,\xi_{n}}\right)-\boldsymbol{f}\mathopen{}\mathclose{{}\left(\xi_{1},\dots,\xi_{i-1},0,\xi_{i+1},\dots,\xi_{n}}\right).

\Delta_{i}\boldsymbol{f}\mathopen{}\mathclose{{}\left(\boldsymbol{\xi}}\right)=\boldsymbol{f}\mathopen{}\mathclose{{}\left(\xi_{1},\dots,\xi_{i-1},1,\xi_{i+1}.\dots,\xi_{n}}\right)-\boldsymbol{f}\mathopen{}\mathclose{{}\left(\xi_{1},\dots,\xi_{i-1},0,\xi_{i+1},\dots,\xi_{n}}\right).

\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left\|\boldsymbol{f}\mathopen{}\mathclose{{}\left(\boldsymbol{\xi}}\right)-\boldsymbol{x}}\right\|_{\infty}<r\sqrt{n\log n}}\right)\leq c\cdot\mathopen{}\mathclose{{}\left(\frac{r\sqrt{\log n}}{s}}\right)^{d}.

\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left\|\boldsymbol{f}\mathopen{}\mathclose{{}\left(\boldsymbol{\xi}}\right)-\boldsymbol{x}}\right\|_{\infty}<r\sqrt{n\log n}}\right)\leq c\cdot\mathopen{}\mathclose{{}\left(\frac{r\sqrt{\log n}}{s}}\right)^{d}.

\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left|\boldsymbol{\xi}^{j}}\right|=t}\right)\leq c_{p}\cdot n_{j}^{-1/2}\leq(c_{p}/\sqrt{\varepsilon})\cdot n^{-1/2}

\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left|\boldsymbol{\xi}^{j}}\right|=t}\right)\leq c_{p}\cdot n_{j}^{-1/2}\leq(c_{p}/\sqrt{\varepsilon})\cdot n^{-1/2}

\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left|\boldsymbol{\xi}^{j}}\right|=t}\right)>n^{-2d}\text{ if and only if }a_{j}\leq t\leq b_{j}.

\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left|\boldsymbol{\xi}^{j}}\right|=t}\right)>n^{-2d}\text{ if and only if }a_{j}\leq t\leq b_{j}.

p n_{j} - d n lo g n \leq a_{j} \leq b_{j} \leq p n_{j} + d n lo g n

p n_{j} - d n lo g n \leq a_{j} \leq b_{j} \leq p n_{j} + d n lo g n

\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left|\boldsymbol{\xi}^{j}}\right|<a_{j}\text{ or }\mathopen{}\mathclose{{}\left|\boldsymbol{\xi}^{j}}\right|>b_{j}}\right)=\sum_{t=0}^{a_{j}-1}\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left|\boldsymbol{\xi}^{j}}\right|=t}\right)+\sum_{t=b_{j}+1}^{n^{j}}\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left|\boldsymbol{\xi}^{j}}\right|=t}\right)\leq n\cdot n^{-2d}=n^{-2d+1}

\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left|\boldsymbol{\xi}^{j}}\right|<a_{j}\text{ or }\mathopen{}\mathclose{{}\left|\boldsymbol{\xi}^{j}}\right|>b_{j}}\right)=\sum_{t=0}^{a_{j}-1}\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left|\boldsymbol{\xi}^{j}}\right|=t}\right)+\sum_{t=b_{j}+1}^{n^{j}}\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left|\boldsymbol{\xi}^{j}}\right|=t}\right)\leq n\cdot n^{-2d}=n^{-2d+1}

\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left\|\boldsymbol{f}(\boldsymbol{\xi})-\boldsymbol{x}}\right\|_{\infty}<r\sqrt{n\log n}\,\,\bigg{|}\,\,\mathopen{}\mathclose{{}\left|\boldsymbol{\xi}^{j}}\right|=t_{j}\text{ for }j=1,\dots,d}\right)\\ =\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left\|\boldsymbol{f}\mathopen{}\mathclose{{}\left(\boldsymbol{\chi}(t_{1},\dots,t_{d})}\right)-\boldsymbol{x}}\right\|_{\infty}<r\sqrt{n\log n}}\right).

\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left\|\boldsymbol{f}(\boldsymbol{\xi})-\boldsymbol{x}}\right\|_{\infty}<r\sqrt{n\log n}\,\,\bigg{|}\,\,\mathopen{}\mathclose{{}\left|\boldsymbol{\xi}^{j}}\right|=t_{j}\text{ for }j=1,\dots,d}\right)\\ =\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left\|\boldsymbol{f}\mathopen{}\mathclose{{}\left(\boldsymbol{\chi}(t_{1},\dots,t_{d})}\right)-\boldsymbol{x}}\right\|_{\infty}<r\sqrt{n\log n}}\right).

\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left\|\boldsymbol{f}(\boldsymbol{\xi})-\boldsymbol{x}}\right\|_{\infty}<r\sqrt{n\log n}\text{ and }\mathopen{}\mathclose{{}\left|\boldsymbol{\xi}^{j}}\right|=t_{j}\text{ for }j=1,\dots,d}\right)\\ =\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left\|\boldsymbol{f}\mathopen{}\mathclose{{}\left(\boldsymbol{\chi}(t_{1},\dots,t_{d})}\right)-\boldsymbol{x}}\right\|_{\infty}<r\sqrt{n\log n}}\right)\cdot\prod_{j=1}^{d}\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left|\boldsymbol{\xi}^{j}}\right|=t_{j}}\right)\\ \leq(c_{p}/\sqrt{\varepsilon})^{d}\cdot n^{-d/2}\cdot\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left\|\boldsymbol{f}\mathopen{}\mathclose{{}\left(\boldsymbol{\chi}(t_{1},\dots,t_{d})}\right)-\boldsymbol{x}}\right\|_{\infty}<r\sqrt{n\log n}}\right)

\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left\|\boldsymbol{f}(\boldsymbol{\xi})-\boldsymbol{x}}\right\|_{\infty}<r\sqrt{n\log n}\text{ and }\mathopen{}\mathclose{{}\left|\boldsymbol{\xi}^{j}}\right|=t_{j}\text{ for }j=1,\dots,d}\right)\\ =\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left\|\boldsymbol{f}\mathopen{}\mathclose{{}\left(\boldsymbol{\chi}(t_{1},\dots,t_{d})}\right)-\boldsymbol{x}}\right\|_{\infty}<r\sqrt{n\log n}}\right)\cdot\prod_{j=1}^{d}\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left|\boldsymbol{\xi}^{j}}\right|=t_{j}}\right)\\ \leq(c_{p}/\sqrt{\varepsilon})^{d}\cdot n^{-d/2}\cdot\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left\|\boldsymbol{f}\mathopen{}\mathclose{{}\left(\boldsymbol{\chi}(t_{1},\dots,t_{d})}\right)-\boldsymbol{x}}\right\|_{\infty}<r\sqrt{n\log n}}\right)

\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left|\boldsymbol{\xi}^{j}}\right|<a_{j}\text{ or }\mathopen{}\mathclose{{}\left|\boldsymbol{\xi}^{j}}\right|>b_{j}\text{ for some }1\leq j\leq d}\right)\leq d\cdot n^{-2d+1}\leq n^{-d}.

\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left|\boldsymbol{\xi}^{j}}\right|<a_{j}\text{ or }\mathopen{}\mathclose{{}\left|\boldsymbol{\xi}^{j}}\right|>b_{j}\text{ for some }1\leq j\leq d}\right)\leq d\cdot n^{-2d+1}\leq n^{-d}.

\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left\|\boldsymbol{f}(\boldsymbol{\xi})-\boldsymbol{x}}\right\|_{\infty}<r\sqrt{n\log n}}\right)\\ \leq n^{-d}+\sum_{(t_{1},\dots,t_{d})}\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left\|\boldsymbol{f}(\boldsymbol{\xi})-\boldsymbol{x}}\right\|_{\infty}<r\sqrt{n\log n}\text{ and }\mathopen{}\mathclose{{}\left|\boldsymbol{\xi}^{j}}\right|=t_{j}\text{ for }j=1,\dots,d}\right)\\ \leq n^{-d}+(c_{p}/\sqrt{\varepsilon})^{d}\cdot n^{-d/2}\cdot\sum_{(t_{1},\dots,t_{d})}\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left\|\boldsymbol{f}\mathopen{}\mathclose{{}\left(\boldsymbol{\chi}(t_{1},\dots,t_{d})}\right)-\boldsymbol{x}}\right\|_{\infty}<r\sqrt{n\log n}}\right),

\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left\|\boldsymbol{f}(\boldsymbol{\xi})-\boldsymbol{x}}\right\|_{\infty}<r\sqrt{n\log n}}\right)\\ \leq n^{-d}+\sum_{(t_{1},\dots,t_{d})}\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left\|\boldsymbol{f}(\boldsymbol{\xi})-\boldsymbol{x}}\right\|_{\infty}<r\sqrt{n\log n}\text{ and }\mathopen{}\mathclose{{}\left|\boldsymbol{\xi}^{j}}\right|=t_{j}\text{ for }j=1,\dots,d}\right)\\ \leq n^{-d}+(c_{p}/\sqrt{\varepsilon})^{d}\cdot n^{-d/2}\cdot\sum_{(t_{1},\dots,t_{d})}\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left\|\boldsymbol{f}\mathopen{}\mathclose{{}\left(\boldsymbol{\chi}(t_{1},\dots,t_{d})}\right)-\boldsymbol{x}}\right\|_{\infty}<r\sqrt{n\log n}}\right),

\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left\|\boldsymbol{f}(\boldsymbol{\xi})-\boldsymbol{x}}\right\|_{\infty}<r\sqrt{n\log n}}\right)\leq n^{-d}+(c_{p}/\sqrt{\varepsilon})^{d}\cdot n^{-d/2}\cdot\mathbb{E}Y,

\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left\|\boldsymbol{f}(\boldsymbol{\xi})-\boldsymbol{x}}\right\|_{\infty}<r\sqrt{n\log n}}\right)\leq n^{-d}+(c_{p}/\sqrt{\varepsilon})^{d}\cdot n^{-d/2}\cdot\mathbb{E}Y,

\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left\|\Delta_{i}\boldsymbol{f}\mathopen{}\mathclose{{}\left(\boldsymbol{\chi}(t_{1},\dots,t_{d})}\right)-s\boldsymbol{v}_{j^{*}}}\right\|_{\infty}\geq r}\right)=\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left\|\Delta_{i}\boldsymbol{f}\mathopen{}\mathclose{{}\left(\boldsymbol{\boldsymbol{\xi}}}\right)-s\boldsymbol{v}_{j^{*}}}\right\|_{\infty}\geq r\,\,\bigg{|}\,\,\mathopen{}\mathclose{{}\left|\boldsymbol{\xi}^{j}}\right|=t_{j}\text{ for }j=1,\dots,d}\right)\\ \leq\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left\|\Delta_{i}\boldsymbol{f}\mathopen{}\mathclose{{}\left(\boldsymbol{\boldsymbol{\xi}}}\right)-s\boldsymbol{v}_{j^{*}}}\right\|_{\infty}\geq r}\right)\cdot\prod_{j=1}^{d}\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left|\boldsymbol{\xi}^{j}}\right|=t_{j}}\right)^{-1}\leq n^{-6d^{2}}\cdot\prod_{j=1}^{d}n^{2d}\leq n^{-4d^{2}}\leq n^{-2d-2},

\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left\|\Delta_{i}\boldsymbol{f}\mathopen{}\mathclose{{}\left(\boldsymbol{\chi}(t_{1},\dots,t_{d})}\right)-s\boldsymbol{v}_{j^{*}}}\right\|_{\infty}\geq r}\right)=\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left\|\Delta_{i}\boldsymbol{f}\mathopen{}\mathclose{{}\left(\boldsymbol{\boldsymbol{\xi}}}\right)-s\boldsymbol{v}_{j^{*}}}\right\|_{\infty}\geq r\,\,\bigg{|}\,\,\mathopen{}\mathclose{{}\left|\boldsymbol{\xi}^{j}}\right|=t_{j}\text{ for }j=1,\dots,d}\right)\\ \leq\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left\|\Delta_{i}\boldsymbol{f}\mathopen{}\mathclose{{}\left(\boldsymbol{\boldsymbol{\xi}}}\right)-s\boldsymbol{v}_{j^{*}}}\right\|_{\infty}\geq r}\right)\cdot\prod_{j=1}^{d}\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left|\boldsymbol{\xi}^{j}}\right|=t_{j}}\right)^{-1}\leq n^{-6d^{2}}\cdot\prod_{j=1}^{d}n^{2d}\leq n^{-4d^{2}}\leq n^{-2d-2},

f (χ (t_{1}, \dots, t_{j^{*} - 1}, t_{j^{*}} + 1, t_{j^{*} + 1}, \dots, t_{d})) - f (χ (t_{1}, \dots, t_{d})) = Δ_{σ_{j^{*}} (t_{j^{*}} + 1)} f (χ (t_{1}, \dots, t_{d})) .

f (χ (t_{1}, \dots, t_{j^{*} - 1}, t_{j^{*}} + 1, t_{j^{*} + 1}, \dots, t_{d})) - f (χ (t_{1}, \dots, t_{d})) = Δ_{σ_{j^{*}} (t_{j^{*}} + 1)} f (χ (t_{1}, \dots, t_{d})) .

\mathopen{}\mathclose{{}\left\|\boldsymbol{f}(\boldsymbol{\chi}(t_{1},\dots,t_{j^{*}-1},t_{j^{*}}+1,t_{j^{*}+1},\dots,t_{d}))-\boldsymbol{f}(\boldsymbol{\chi}(t_{1},\dots,t_{d}))-s\boldsymbol{v}_{j^{*}}}\right\|_{\infty}\leq r

\mathopen{}\mathclose{{}\left\|\boldsymbol{f}(\boldsymbol{\chi}(t_{1},\dots,t_{j^{*}-1},t_{j^{*}}+1,t_{j^{*}+1},\dots,t_{d}))-\boldsymbol{f}(\boldsymbol{\chi}(t_{1},\dots,t_{d}))-s\boldsymbol{v}_{j^{*}}}\right\|_{\infty}\leq r

\mathopen{}\mathclose{{}\left\|\boldsymbol{f}(\boldsymbol{\chi}(t_{1},\dots,,t_{d}))-\boldsymbol{f}(\boldsymbol{\chi}(a_{1},\dots,a_{d}))-(t_{1}-a_{1})s\boldsymbol{v}_{1}-\dots-(t_{d}-a_{d})s\boldsymbol{v}_{d}}\right\|_{\infty}\\ \leq((t_{1}-a_{1})+\dots+(t_{d}-a_{d}))\cdot r\leq 2d^{2}\sqrt{n\log n}\cdot r,

\mathopen{}\mathclose{{}\left\|\boldsymbol{f}(\boldsymbol{\chi}(t_{1},\dots,,t_{d}))-\boldsymbol{f}(\boldsymbol{\chi}(a_{1},\dots,a_{d}))-(t_{1}-a_{1})s\boldsymbol{v}_{1}-\dots-(t_{d}-a_{d})s\boldsymbol{v}_{d}}\right\|_{\infty}\\ \leq((t_{1}-a_{1})+\dots+(t_{d}-a_{d}))\cdot r\leq 2d^{2}\sqrt{n\log n}\cdot r,

\mathopen{}\mathclose{{}\left\|\boldsymbol{x}-\boldsymbol{f}(\boldsymbol{\chi}(a_{1},\dots,a_{d}))-(t_{1}-a_{1})s\boldsymbol{v}_{1}-\dots-(t_{d}-a_{d})s\boldsymbol{v}_{d}}\right\|_{\infty}\\ <(2d^{2}+1)\sqrt{n\log n}\cdot r,

\mathopen{}\mathclose{{}\left\|\boldsymbol{x}-\boldsymbol{f}(\boldsymbol{\chi}(a_{1},\dots,a_{d}))-(t_{1}-a_{1})s\boldsymbol{v}_{1}-\dots-(t_{d}-a_{d})s\boldsymbol{v}_{d}}\right\|_{\infty}\\ <(2d^{2}+1)\sqrt{n\log n}\cdot r,

\mathopen{}\mathclose{{}\left\|t_{1}\boldsymbol{v}_{1}+\dots+t_{d}\boldsymbol{v}_{d}-a_{1}\boldsymbol{v}_{1}-\dots-a_{d}\boldsymbol{v}_{d}+\frac{1}{s}\boldsymbol{f}(\boldsymbol{\chi}(a_{1},\dots,a_{d})))-\frac{1}{s}\boldsymbol{x}}\right\|_{\infty}\\ <(2d^{2}+1)\cdot\frac{\sqrt{n\log n}\cdot r}{s}.

\mathopen{}\mathclose{{}\left\|t_{1}\boldsymbol{v}_{1}+\dots+t_{d}\boldsymbol{v}_{d}-a_{1}\boldsymbol{v}_{1}-\dots-a_{d}\boldsymbol{v}_{d}+\frac{1}{s}\boldsymbol{f}(\boldsymbol{\chi}(a_{1},\dots,a_{d})))-\frac{1}{s}\boldsymbol{x}}\right\|_{\infty}\\ <(2d^{2}+1)\cdot\frac{\sqrt{n\log n}\cdot r}{s}.

Y\leq c^{\prime}\cdot(2d^{2}+1)^{d}\cdot\mathopen{}\mathclose{{}\left(\frac{\sqrt{n\log n}\cdot r}{s}}\right)^{d},

Y\leq c^{\prime}\cdot(2d^{2}+1)^{d}\cdot\mathopen{}\mathclose{{}\left(\frac{\sqrt{n\log n}\cdot r}{s}}\right)^{d},

\mathbb{E}Y\leq n^{-d}\cdot(2n)^{d}+c^{\prime}\cdot(2d^{2}+1)^{d}\cdot\mathopen{}\mathclose{{}\left(\frac{\sqrt{n\log n}\cdot r}{s}}\right)^{d}\leq(c^{\prime}+1)\cdot(2d^{2}+1)^{d}\cdot\mathopen{}\mathclose{{}\left(\frac{\sqrt{n\log n}\cdot r}{s}}\right)^{d},

\mathbb{E}Y\leq n^{-d}\cdot(2n)^{d}+c^{\prime}\cdot(2d^{2}+1)^{d}\cdot\mathopen{}\mathclose{{}\left(\frac{\sqrt{n\log n}\cdot r}{s}}\right)^{d}\leq(c^{\prime}+1)\cdot(2d^{2}+1)^{d}\cdot\mathopen{}\mathclose{{}\left(\frac{\sqrt{n\log n}\cdot r}{s}}\right)^{d},

\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left\|\boldsymbol{f}(\boldsymbol{\xi})-\boldsymbol{x}}\right\|_{\infty}<r\sqrt{n\log n}}\right)\leq n^{-d}+(c_{p}/\sqrt{\varepsilon})^{d}\cdot n^{-d/2}\cdot(c^{\prime}+1)\cdot(2d^{2}+1)^{d}\cdot\mathopen{}\mathclose{{}\left(\frac{\sqrt{n\log n}\cdot r}{s}}\right)^{d}\\ =n^{-d}+(c_{p}/\sqrt{\varepsilon})^{d}\cdot(c^{\prime}+1)\cdot(2d^{2}+1)^{d}\cdot\mathopen{}\mathclose{{}\left(\frac{r\sqrt{\log n}}{s}}\right)^{d}\leq((c_{p}/\sqrt{\varepsilon})^{d}\cdot(c^{\prime}+1)\cdot(2d^{2}+1)^{d}+1)\cdot\mathopen{}\mathclose{{}\left(\frac{r\sqrt{\log n}}{s}}\right)^{d},

\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left\|\boldsymbol{f}(\boldsymbol{\xi})-\boldsymbol{x}}\right\|_{\infty}<r\sqrt{n\log n}}\right)\leq n^{-d}+(c_{p}/\sqrt{\varepsilon})^{d}\cdot n^{-d/2}\cdot(c^{\prime}+1)\cdot(2d^{2}+1)^{d}\cdot\mathopen{}\mathclose{{}\left(\frac{\sqrt{n\log n}\cdot r}{s}}\right)^{d}\\ =n^{-d}+(c_{p}/\sqrt{\varepsilon})^{d}\cdot(c^{\prime}+1)\cdot(2d^{2}+1)^{d}\cdot\mathopen{}\mathclose{{}\left(\frac{r\sqrt{\log n}}{s}}\right)^{d}\leq((c_{p}/\sqrt{\varepsilon})^{d}\cdot(c^{\prime}+1)\cdot(2d^{2}+1)^{d}+1)\cdot\mathopen{}\mathclose{{}\left(\frac{r\sqrt{\log n}}{s}}\right)^{d},

\mathopen{}\mathclose{{}\left|\mathopen{}\mathclose{{}\left|S_{I}}\right|-p^{\mathopen{}\mathclose{{}\left|I}\right|}\mathopen{}\mathclose{{}\left(1-p}\right)^{m-\mathopen{}\mathclose{{}\left|I}\right|}|R|}\right|\leq K\cdot|R|^{1/2}\log|R|.

\mathopen{}\mathclose{{}\left|\mathopen{}\mathclose{{}\left|S_{I}}\right|-p^{\mathopen{}\mathclose{{}\left|I}\right|}\mathopen{}\mathclose{{}\left(1-p}\right)^{m-\mathopen{}\mathclose{{}\left|I}\right|}|R|}\right|\leq K\cdot|R|^{1/2}\log|R|.

\Pr\mathopen{}\mathclose{{}\left(\vphantom{\sum}\psi_{H}(\mathcal{G},G_{0},\cdot)=\lambda}\right)\leq n^{(g-h+1)\cdot T+o(1)}.

\Pr\mathopen{}\mathclose{{}\left(\vphantom{\sum}\psi_{H}(\mathcal{G},G_{0},\cdot)=\lambda}\right)\leq n^{(g-h+1)\cdot T+o(1)}.

\mathopen{}\mathclose{{}\left|\mathopen{}\mathclose{{}\left|\bigcap_{i\in I}S_{i}}\right|-p^{\mathopen{}\mathclose{{}\left|I}\right|}|R|}\right|\leq 2^{m}\cdot K\cdot|R|^{1/2}\log|R|.

\mathopen{}\mathclose{{}\left|\mathopen{}\mathclose{{}\left|\bigcap_{i\in I}S_{i}}\right|-p^{\mathopen{}\mathclose{{}\left|I}\right|}|R|}\right|\leq 2^{m}\cdot K\cdot|R|^{1/2}\log|R|.

\mathopen{}\mathclose{{}\left|\bigcap_{i\in I}S_{i}}\right|=\sum_{J}\mathopen{}\mathclose{{}\left|S_{J}}\right|.

\mathopen{}\mathclose{{}\left|\bigcap_{i\in I}S_{i}}\right|=\sum_{J}\mathopen{}\mathclose{{}\left|S_{J}}\right|.

p^{\mathopen{}\mathclose{{}\left|I}\right|}|R|=p^{\mathopen{}\mathclose{{}\left|I}\right|}\mathopen{}\mathclose{{}\left(p+(1-p)}\right)^{m-\mathopen{}\mathclose{{}\left|I}\right|}|R|=\sum_{J}p^{\mathopen{}\mathclose{{}\left|J}\right|}\mathopen{}\mathclose{{}\left(1-p}\right)^{m-\mathopen{}\mathclose{{}\left|J}\right|}|R|,

p^{\mathopen{}\mathclose{{}\left|I}\right|}|R|=p^{\mathopen{}\mathclose{{}\left|I}\right|}\mathopen{}\mathclose{{}\left(p+(1-p)}\right)^{m-\mathopen{}\mathclose{{}\left|I}\right|}|R|=\sum_{J}p^{\mathopen{}\mathclose{{}\left|J}\right|}\mathopen{}\mathclose{{}\left(1-p}\right)^{m-\mathopen{}\mathclose{{}\left|J}\right|}|R|,

\mathopen{}\mathclose{{}\left|\mathopen{}\mathclose{{}\left|\bigcap_{i\in I}S_{i}}\right|-p^{\mathopen{}\mathclose{{}\left|I}\right|}|R|}\right|=\mathopen{}\mathclose{{}\left|\sum_{J}\mathopen{}\mathclose{{}\left|S_{J}}\right|-\sum_{J}p^{\mathopen{}\mathclose{{}\left|J}\right|}\mathopen{}\mathclose{{}\left(1-p}\right)^{m-\mathopen{}\mathclose{{}\left|J}\right|}|R|}\right|\leq\sum_{J}\mathopen{}\mathclose{{}\left|\mathopen{}\mathclose{{}\left|S_{J}}\right|-p^{\mathopen{}\mathclose{{}\left|J}\right|}\mathopen{}\mathclose{{}\left(1-p}\right)^{m-\mathopen{}\mathclose{{}\left|J}\right|}|R|}\right|\\ \leq\sum_{J}K\cdot|R|^{1/2}\log|R|\leq 2^{m}\cdot K\cdot|R|^{1/2}\log|R|,

\mathopen{}\mathclose{{}\left|\mathopen{}\mathclose{{}\left|\bigcap_{i\in I}S_{i}}\right|-p^{\mathopen{}\mathclose{{}\left|I}\right|}|R|}\right|=\mathopen{}\mathclose{{}\left|\sum_{J}\mathopen{}\mathclose{{}\left|S_{J}}\right|-\sum_{J}p^{\mathopen{}\mathclose{{}\left|J}\right|}\mathopen{}\mathclose{{}\left(1-p}\right)^{m-\mathopen{}\mathclose{{}\left|J}\right|}|R|}\right|\leq\sum_{J}\mathopen{}\mathclose{{}\left|\mathopen{}\mathclose{{}\left|S_{J}}\right|-p^{\mathopen{}\mathclose{{}\left|J}\right|}\mathopen{}\mathclose{{}\left(1-p}\right)^{m-\mathopen{}\mathclose{{}\left|J}\right|}|R|}\right|\\ \leq\sum_{J}K\cdot|R|^{1/2}\log|R|\leq 2^{m}\cdot K\cdot|R|^{1/2}\log|R|,

\mathopen{}\mathclose{{}\left|\mathopen{}\mathclose{{}\left|S_{I}\cap S_{m+1}}\right|-p^{\mathopen{}\mathclose{{}\left|I}\right|+1}\mathopen{}\mathclose{{}\left(1-p}\right)^{m-\mathopen{}\mathclose{{}\left|I}\right|}n}\right|\leq K\cdot n^{1/2}\log n.

\mathopen{}\mathclose{{}\left|\mathopen{}\mathclose{{}\left|S_{I}\cap S_{m+1}}\right|-p^{\mathopen{}\mathclose{{}\left|I}\right|+1}\mathopen{}\mathclose{{}\left(1-p}\right)^{m-\mathopen{}\mathclose{{}\left|I}\right|}n}\right|\leq K\cdot n^{1/2}\log n.

\mathopen{}\mathclose{{}\left|\mathopen{}\mathclose{{}\left|S_{I}\cap(R\!\setminus\!S_{m+1})}\right|-p^{\mathopen{}\mathclose{{}\left|I}\right|}\mathopen{}\mathclose{{}\left(1-p}\right)^{m-\mathopen{}\mathclose{{}\left|I}\right|+1}n}\right|\leq K\cdot n^{1/2}\log n.

\mathopen{}\mathclose{{}\left|\mathopen{}\mathclose{{}\left|S_{I}\cap(R\!\setminus\!S_{m+1})}\right|-p^{\mathopen{}\mathclose{{}\left|I}\right|}\mathopen{}\mathclose{{}\left(1-p}\right)^{m-\mathopen{}\mathclose{{}\left|I}\right|+1}n}\right|\leq K\cdot n^{1/2}\log n.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Anticoncentration for subgraph counts in random graphs

Jacob Fox Department of Mathematics, Stanford University, Stanford, CA 94305. Email: [email protected]. Research supported by a Packard Fellowship and by NSF Career Award DMS-1352121.

Matthew Kwan Department of Mathematics, Stanford University, Stanford, CA 94305. Email: [email protected]. Research supported in part by SNSF project 178493.

Lisa Sauermann School of Mathematics, Institute for Advanced Study, Princeton, NJ 08540. Email: [email protected].

Abstract

Fix a graph $H$ and some $p\in(0,1)$ , and let $X_{H}$ be the number of copies of $H$ in a random graph $\mathbb{G}(n,p)$ . Random variables of this form have been intensively studied since the foundational work of Erdős and Rényi. There has been a great deal of progress over the years on the large-scale behaviour of $X_{H}$ , but the more challenging problem of understanding the small-ball probabilities has remained poorly understood until now. More precisely, how likely can it be that $X_{H}$ falls in some small interval or is equal to some particular value? In this paper we prove the almost-optimal result that if $H$ is connected then for any $x\in\mathbb{N}$ we have $\Pr(X_{H}=x)\leq n^{1-v(H)+o(1)}$ . Our proof proceeds by iteratively breaking $X_{H}$ into different components which fluctuate at “different scales”, and relies on a new anticoncentration inequality for random vectors that behave “almost linearly”.

1 Introduction

Let $\mathbb{G}\mathopen{}\mathclose{{}\left(n,p}\right)$ be the binomial random graph model, where we fix a set of $n$ vertices and include each of the $\binom{n}{2}$ possible edges independently with probability $p$ . For graphs $H$ and $G$ , let $X_{H}\mathopen{}\mathclose{{}\left(G}\right)$ be the number of subgraphs of $G$ isomorphic to $H$ , so that if $G\sim\mathbb{G}\mathopen{}\mathclose{{}\left(n,p}\right)$ then we can interpret $X_{H}$ as the random variable that counts the number of copies of $H$ in a random graph.

These subgraph-counting random variables and their distributions are central objects of study in the theory of random graphs, going back to the foundational work of Erdős and Rényi [17]. Early work [9, 17, 35] concerned* *existence of subgraphs: fixing $H$ , for which $n$ and $p$ is it likely that $X_{H}>0$ , and for which $n$ and $p$ is it likely that $X_{H}=0$ ? It turns out that there is a threshold value of $p$ (as a decaying function of $n$ ) that cleanly separates these two behaviours, and further work [9, 10, 17, 25, 35, 37] focused on investigating the (Poisson-type) distribution of $X_{H}$ near this threshold.

In this paper, we are more interested in the behaviour far above this existence threshold (when $p$ is a constant independent of $n$ ). When appropriately normalised, $X_{H}$ has an asymptotically111All asymptotics, here and for the rest of the paper, are as $n\to\infty$ , while $p$ is fixed. normal distribution (this was proved by Nowicki and Wierman [32] and Ruciński [34], following several results [3, 24, 25] pushing increasingly further past the existence threshold). Further work by Barbour, Karoński and Ruciński [4] provided quantitative bounds on the rate of convergence to the normal distribution. However, this asymptotic normality only characterises the “large-scale” behaviour of the distribution of $X_{H}$ , and is basically due to the fact that $X_{H}$ closely correlates with the number of edges in $\mathbb{G}\mathopen{}\mathclose{{}\left(n,p}\right)$ .

A more challenging direction of research is to understand “local” aspects of the distributions of these subgraph-counting random variables222We remark that another completely different challenging direction of research is the study of large deviations of subgraph counts in random graphs. Recently, there has been a lot of progress in this area, see for example the monograph [11], the recent papers [2, 5, 8, 13, 22], and the references therein.. In the past decade, there have been a number of advances in this direction. Following work by Loebl, Matoušek and Pangrác [27] for the case where $H$ is a triangle, it was proved by Kolaitis and Kopparty [26] (see also [14]) that if we fix some $p\in\mathopen{}\mathclose{{}\left(0,1}\right)$ , some prime $q\in\mathbb{N}$ and some connected graph $H$ with at least one edge, then $X_{H}$ mod $q$ has an asymptotically uniform distribution on $\mathopen{}\mathclose{{}\left\{0,\dots,q-1}\right\}$ . More recently, local central limit theorems have begun to emerge, giving asymptotic formulas for the point probabilities $\Pr\mathopen{}\mathclose{{}\left(X_{H}=x}\right)$ in terms of a normal density function. Such a theorem was first proved for the case where $H$ is a triangle by Gilmer and Kopparty [20] (see also [6]), and this was extended by Berkowitz [7] to the case where $H$ is any clique.

In this paper we are concerned with a somewhat looser question: what can be said about the anticoncentration behaviour of $X_{H}$ ? Roughly speaking, this is asking for uniform upper bounds on the point probabilities $\Pr\mathopen{}\mathclose{{}\left(X_{H}=x}\right)$ , or more generally on the small ball probabilities $\Pr\mathopen{}\mathclose{{}\left(X_{H}\in I}\right)$ , where $I$ is an interval of prescribed length. Meka, Nguyen and Vu [30] developed some general polynomial anticoncentration inequalities, and used the polynomial structure of $X_{H}$ to prove the bound $\Pr\mathopen{}\mathclose{{}\left(X_{H}=x}\right)\leq n^{-1+o\mathopen{}\mathclose{{}\left(1}\right)}$ for constant $p\in\mathopen{}\mathclose{{}\left(0,1}\right)$ and any $H$ that contains at least one edge. In [19] we proposed the following conjecture.

Conjecture 1.1.

Fix $p\in\mathopen{}\mathclose{{}\left(0,1}\right)$ and fix a graph $H$ with no isolated vertices. Then

[TABLE]

where $v(H)$ is the number of vertices of $H$ .

The motivation for Conjecture 1.1 is that if $p$ is fixed then $\operatorname{Var}X_{H}=\Theta\mathopen{}\mathclose{{}\left(n^{2v\mathopen{}\mathclose{{}\left(H}\right)-2}}\right)$ (see for example [23, Lemma 3.5]), and the aforementioned asymptotic normality therefore implies that $X_{H}$ is concentrated on an interval of length $\Theta\mathopen{}\mathclose{{}\left(n^{v\mathopen{}\mathclose{{}\left(H}\right)-1}}\right)$ . Provided that the distribution of $X_{H}$ is sufficiently “smooth”, we should expect each value in this interval to have comparable probability. Note that this line of reasoning implies that Conjecture 1.1, if true, is best possible: any stronger bound would contradict Chebyshev’s inequality. Also, observe that the assumption that $H$ has no isolated vertices is necessary: if $H^{\prime}$ is obtained from $H$ by removing isolated vertices then $X_{H}$ is a deterministic multiple of $X_{H^{\prime}}$ , so inherits its point probabilities.

We also remark that while Meka, Nguyen and Vu were the first to explicitly consider anticoncentration of subgraph counts in general, actually Pangrác, Matoušek and Loebl [27] considered anticoncentration of the triangle-count $X_{K_{3}}$ more than ten years earlier: a primary motivation for their aforementioned work on triangle-counts mod $q$ was to show that the point probabilities $\Pr\mathopen{}\mathclose{{}\left(X_{K_{3}}=x}\right)$ are small (they gave a bound of $O\mathopen{}\mathclose{{}\left(1/\log n}\right)$ ), which in turn implies that two independent copies of $\mathbb{G}\mathopen{}\mathclose{{}\left(n,p}\right)$ are unlikely to have the same Tutte polynomial. In addition, many of the other aforementioned results concerning the distribution of $X_{H}$ imply anticoncentration bounds: any central limit theorem already implies that $\max_{x}\Pr\mathopen{}\mathclose{{}\left(X_{H}=x}\right)=o\mathopen{}\mathclose{{}\left(1}\right)$ , and one can333The central limit theorem of Barbour, Karoński and Ruciński was not stated in a way that allows one to directly read off estimates for probabilities regarding $X_{H}$ . But, it is possible to deduce such an estimate with the method of [33, Proposition 1.2.2]. deduce from the quantitative central limit theorem of Barbour, Karoński and Ruciński [4] that $\Pr\mathopen{}\mathclose{{}\left(X_{H}=x}\right)\leq\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left|X-x}\right|\leq n^{v\mathopen{}\mathclose{{}\left(H}\right)-3/2}}\right)=O\mathopen{}\mathclose{{}\left(1/\sqrt{n}}\right)$ . The local central limit theorem of Berkowitz [7] definitively settles the matter in the case where $H=K_{h}$ is an $h$ -vertex clique, in which case it actually gives the asymptotically optimal bound $\Pr\mathopen{}\mathclose{{}\left(X_{H}=x}\right)\leq\mathopen{}\mathclose{{}\left(2\pi\operatorname{Var}X_{H}}\right)^{-1/2}+o\mathopen{}\mathclose{{}\left(n^{1-h}}\right)=O(n^{1-h})$ .

In [19] we used ideas related to Erdős’ combinatorial proof of the Erdős–Littlewood–Offord theorem (see [18]) to give a simple proof of the general bound $\Pr\mathopen{}\mathclose{{}\left(X_{H}=x}\right)\leq\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left|X_{H}-x}\right|\leq n^{v\mathopen{}\mathclose{{}\left(H}\right)-2}}\right)=O\mathopen{}\mathclose{{}\left(1/n}\right)$ , and we showed how to extend these methods to prove the sharper bound $\Pr\mathopen{}\mathclose{{}\left(X_{K_{h}}=x}\right)=n^{1-h+o\mathopen{}\mathclose{{}\left(1}\right)}$ in the case where $H=K_{h}$ is a clique. In this paper we develop these methods much further, proving an approximate version of Conjecture 1.1 for all connected $H$ .

Theorem 1.2.

Fix $p\in\mathopen{}\mathclose{{}\left(0,1}\right)$ and fix a connected graph $H$ . Then

[TABLE]

Concerning the $o(1)$ -term in Theorem 1.2, the arguments in our proof yield a bound for the $o(1)$ -term that decays extremely slowly as $n$ goes to infinity. In order to simplify the presentation of the proof and to avoid additional technical details, we decided to write the proof in a way that does not give any explicit bounds for the $o(1)$ -term.

The general idea for the proof of Theorem 1.2 is to break up $X_{H}$ into different components that fluctuate at “different scales” and handle each component separately. In the case where $H$ is a clique, this plan is relatively simple to execute, but in the more general setting of Theorem 1.2 there are a number of additional challenges that must be overcome. We discuss these in Section 2. Our proof has a number of new ingredients; one that is perhaps worth highlighting is a combinatorial anticoncentration inequality for vector-valued random variables that behave “almost linearly”, in the spirit of some anticoncentration theorems due to Halász. The details are in Section 3.

1.1 Basic definitions and notation

We use standard graph-theoretic notation throughout. In particular, the vertex and edge sets of a graph $G$ are denoted by $V(G)$ and $E(G)$ , and the sizes of these sets are denoted by $e(G)$ and $v(G)$ . For disjoint vertex sets $A,B$ in a graph $G$ , we write $e_{G}(A)$ for the number of edges $e(G[A])$ inside $A$ , and we write $e_{G}(A,B)$ for the number of edges between $A$ and $B$ . Abusing notation, for a vertex $v$ we write $e_{G}(v,A)$ to mean $e_{G}(\{v\},A)$ , which is the size of the $A$ -neighbourhood of $v$ in $G$ . We write $N(v)$ for the neighbourhood of $v$ (that is, the set of vertices adjacent to $v$ ). A homomorphism $\phi$ from a multigraph $H$ to a multigraph $G$ consists of a map from the vertices of $H$ to the vertices of $G$ , and a map from the edges of $H$ to the edges of $G$ , such that whenever $e$ is an edge of $H$ between vertices $x$ and $y$ , the image $\phi(e)$ is an edge between the vertices $\phi(x)$ and $\phi(y)$ .

We initially introduced $X_{H}$ as the random variable that counts unlabelled copies of $H$ (as is standard in this area), but for the proof it will be slightly more convenient to redefine $X_{H}$ to count the number of labelled copies of $H$ (injective homomorphisms from $H$ into $G$ ). The labelled/unlabelled distinction is irrelevant for Theorem 1.2, because these two counts differ by a fixed multiplicative factor (the number of automorphisms of $H$ ).

We use standard asymptotic notation throughout. For functions $f=f\mathopen{}\mathclose{{}\left(n}\right)$ and $g=g\mathopen{}\mathclose{{}\left(n}\right)$ we write $f=O\mathopen{}\mathclose{{}\left(g}\right)$ to mean that there is a constant $C$ such that $\mathopen{}\mathclose{{}\left|f}\right|\leq C\mathopen{}\mathclose{{}\left|g}\right|$ , we write $f=\Omega\mathopen{}\mathclose{{}\left(g}\right)$ to mean there is a constant $c>0$ such that $f\geq c\mathopen{}\mathclose{{}\left|g}\right|$ for sufficiently large $n$ , we write $f=\Theta\mathopen{}\mathclose{{}\left(g}\right)$ to mean that $f=O\mathopen{}\mathclose{{}\left(g}\right)$ and $f=\Omega\mathopen{}\mathclose{{}\left(g}\right)$ , and we write $f=o\mathopen{}\mathclose{{}\left(g}\right)$ or $g=\omega\mathopen{}\mathclose{{}\left(f}\right)$ to mean that $f/g\to 0$ as $n\to\infty$ . Unless stated otherwise, all asymptotics are as $n\to\infty$ (all other variables should be viewed as constant).

We will use notation of the form $\mathbb{E}_{G}$ to indicate an expected value with respect to a random choice of $G$ (if there are other sources of randomness, then formally this is a conditional expected value). We write $\operatorname{Ber}(p)$ for the $p$ -Bernoulli distribution, meaning that if $\xi\sim\operatorname{Ber}(p)$ then $\Pr(\xi=1)=p$ and $\Pr(\xi=0)=1-p$ . Finally, we write $\mathbb{N}$ for the set of non-negative integers, we write $[n]$ for the set $\{1,\dots,n\}$ , and all logarithms are to base $e$ .

2 Discussion and main ideas of the proof

Before discussing the new ideas in the proof of Theorem 1.2, it is worth reviewing the proofs in [19] giving weaker anticoncentration bounds. We will build on these ideas to prove Theorem 1.2.

First, to prove the bound $\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left|X_{H}-x}\right|\leq n^{v\mathopen{}\mathclose{{}\left(H}\right)-2}}\right)=O\mathopen{}\mathclose{{}\left(1/n}\right)$ , the key observation was that $X_{H}$ is an “almost linear” function of the edges of the underlying random graph $G\sim\mathbb{G}\mathopen{}\mathclose{{}\left(n,p}\right)$ . Specifically, for a pair of vertices $e=\mathopen{}\mathclose{{}\left\{x,y}\right\}$ , the difference $\Delta X_{H}:=X_{H}\mathopen{}\mathclose{{}\left(G+e}\right)-X_{H}\mathopen{}\mathclose{{}\left(G-e}\right)$ is tightly concentrated around its expectation $\mathbb{E}\Delta X_{H}=\Theta\mathopen{}\mathclose{{}\left(n^{v\mathopen{}\mathclose{{}\left(H}\right)-2}}\right)$ , meaning that adding or removing an edge typically causes $X_{H}$ to increase or decrease by about this amount444This observation is closely related to the fact that the number of edges in $G$ is closely correlated with $X_{H}$ . This fact can be used to prove a central limit theorem for $X_{H}$ (see for example [23, Example 6.4]).. Using this observation, and some ideas from Erdős’ proof of the Erdős–Littlewood–Offord theorem [18] and Lubell’s proof of the LYM inequality [28], it is possible to prove that the anticoncentration behaviour of $X_{H}/\mathbb{E}\Delta X_{H}$ is about the same as the anticoncentration behaviour of the number of edges of $G$ , which is a binomial random variable with parameters $\binom{n}{2}$ and $p$ . This gives the desired bound, which in some sense gives “coarse scale” anticoncentration for $X_{H}$ .

Second, to prove the bound $\Pr\mathopen{}\mathclose{{}\left(X_{K_{h}}=x}\right)\leq n^{1-h+o\mathopen{}\mathclose{{}\left(1}\right)}$ for cliques, the key idea was to fix a vertex $v$ and write $X_{K_{h}}\mathopen{}\mathclose{{}\left(G}\right)=X_{K_{h}}\mathopen{}\mathclose{{}\left(G-v}\right)+X_{K_{h-1}}\mathopen{}\mathclose{{}\left(G\mathopen{}\mathclose{{}\left[N\mathopen{}\mathclose{{}\left(v}\right)}\right]}\right)$ . That is, the number of $h$ -cliques in $G$ is the same as the number of $h$ -cliques in $G-v$ , plus the number of $\mathopen{}\mathclose{{}\left(h-1}\right)$ -cliques in the neighbourhood of $v$ (which yield an $h$ -clique when combined with $v$ ). Now, it is possible to use similar ideas as in the preceding paragraph to show that $X_{K_{h}}\mathopen{}\mathclose{{}\left(G-v}\right)$ is anticoncentrated at a coarse scale. On the other hand, $X_{K_{h-1}}\mathopen{}\mathclose{{}\left(G\mathopen{}\mathclose{{}\left[N\mathopen{}\mathclose{{}\left(v}\right)}\right]}\right)$ has a much smaller order of magnitude, and it is actually concentrated on a relatively small interval around its expectation. Furthermore, we can bound the point probabilities for $X_{K_{h-1}}\mathopen{}\mathclose{{}\left(G\mathopen{}\mathclose{{}\left[N\mathopen{}\mathclose{{}\left(v}\right)}\right]}\right)$ inductively.

Then, roughly speaking, the idea was as follows: We want to bound the probability of having $X_{K_{h}}\mathopen{}\mathclose{{}\left(G}\right)=X_{K_{h}}\mathopen{}\mathclose{{}\left(G-v}\right)+X_{K_{h-1}}\mathopen{}\mathclose{{}\left(G\mathopen{}\mathclose{{}\left[N\mathopen{}\mathclose{{}\left(v}\right)}\right]}\right)=x$ . Since $X_{K_{h-1}}\mathopen{}\mathclose{{}\left(G\mathopen{}\mathclose{{}\left[N\mathopen{}\mathclose{{}\left(v}\right)}\right]}\right)$ is concentrated on a small interval around its expectation $\mathbb{E}X_{K_{h-1}}\mathopen{}\mathclose{{}\left(G\mathopen{}\mathclose{{}\left[N\mathopen{}\mathclose{{}\left(v}\right)}\right]}\right)$ , in order to have $X_{K_{h}}\mathopen{}\mathclose{{}\left(G}\right)=x$ the value of $X_{K_{h}}\mathopen{}\mathclose{{}\left(G-v}\right)$ must (typically) be reasonably close to $x-\mathbb{E}X_{K_{h-1}}\mathopen{}\mathclose{{}\left(G\mathopen{}\mathclose{{}\left[N\mathopen{}\mathclose{{}\left(v}\right)}\right]}\right)$ . Using the coarse scale anticoncentration bounds for $X_{K_{h}}\mathopen{}\mathclose{{}\left(G-v}\right)$ , we can bound the probability that this happens. After knowing the value $X_{K_{h}}\mathopen{}\mathclose{{}\left(G-v}\right)$ , we know what value $X_{K_{h-1}}\mathopen{}\mathclose{{}\left(G\mathopen{}\mathclose{{}\left[N\mathopen{}\mathclose{{}\left(v}\right)}\right]}\right)$ needs to take to have $X_{K_{h}}\mathopen{}\mathclose{{}\left(G}\right)=x$ . By induction, we can bound the probability that $X_{K_{h-1}}\mathopen{}\mathclose{{}\left(G\mathopen{}\mathclose{{}\left[N\mathopen{}\mathclose{{}\left(v}\right)}\right]}\right)$ takes this particular value.

If $X_{K_{h}}\mathopen{}\mathclose{{}\left(G-v}\right)$ and $X_{K_{h-1}}\mathopen{}\mathclose{{}\left(G\mathopen{}\mathclose{{}\left[N\mathopen{}\mathclose{{}\left(v}\right)}\right]}\right)$ were independent, it would be easy to conclude the desired anticoncentration bound $\Pr\mathopen{}\mathclose{{}\left(X_{K_{h}}=x}\right)\leq n^{1-h+o\mathopen{}\mathclose{{}\left(1}\right)}$ ; we would be able to simply multiply the above two probability estimates. Unfortunately, these random variables are not independent, so we need to rule out the possibility that the fluctuations in $X_{K_{h}}\mathopen{}\mathclose{{}\left(G-v}\right)$ “cancel out” the fluctuations in $X_{K_{h-1}}\mathopen{}\mathclose{{}\left(G\mathopen{}\mathclose{{}\left[N\mathopen{}\mathclose{{}\left(v}\right)}\right]}\right)$ in a way that causes $X_{K_{h}}$ to concentrate on a particular value. The approach we took was to show that actually $X_{K_{h-1}}\mathopen{}\mathclose{{}\left(G\mathopen{}\mathclose{{}\left[N\mathopen{}\mathclose{{}\left(v}\right)}\right]}\right)$ is anticoncentrated even after conditioning on a typical outcome of $G-v$ (after which the only remaining randomness comes from the set of neighbours $N\mathopen{}\mathclose{{}\left(v}\right)$ ).

We proved a conditional anticoncentration bound of this type by induction, using a moment argument. To be more specific, we viewed the conditional probabilities $\Pr\mathopen{}\mathclose{{}\left(X_{K_{h-1}}\mathopen{}\mathclose{{}\left(G\mathopen{}\mathclose{{}\left[N\mathopen{}\mathclose{{}\left(v}\right)}\right]}\right)=z\,\middle|\,G-v}\right)$ as random variables depending on $G-v$ . To study these random variables we studied their high moments, which essentially comes down to considering collections of different candidates for the neighborhood $N(v)$ , and bounding the probability that for each of these candidates we simultaneously have $X_{K_{h-1}}\mathopen{}\mathclose{{}\left(G\mathopen{}\mathclose{{}\left[N\mathopen{}\mathclose{{}\left(v}\right)}\right]}\right)=z$ . We accomplished this with a multiple exposure argument: we iteratively went through our candidate sets for $N(v)$ and exposed the edges of $G-v$ inside each set which had not yet been exposed before. Using a suitable induction hypothesis, and the ideas sketched earlier, at each step we can bound the probability that the corresponding candidate set for $N(v)$ gives rise to the desired value of $X_{K_{h-1}}\mathopen{}\mathclose{{}\left(G\mathopen{}\mathclose{{}\left[N\mathopen{}\mathclose{{}\left(v}\right)}\right]}\right)$ . This way we obtained a suitable bound for the moment argument.

Now, there are several obstacles that need to be overcome to generalise the above argument beyond the case where $H$ is a clique. First, the decomposition $X_{K_{h}}=X_{K_{h}}\mathopen{}\mathclose{{}\left(G-v}\right)+X_{K_{h-1}}\mathopen{}\mathclose{{}\left(G\mathopen{}\mathclose{{}\left[N\mathopen{}\mathclose{{}\left(v}\right)}\right]}\right)$ was very convenient for us: it allowed us to consider two separate random variables, one of which can be studied on a “coarse” scale using our Littlewood–Offord type techniques, and the other of which can be studied inductively. Actually, it is not a huge problem to generalise this decomposition. In general, we have $X_{H}=X_{H}\mathopen{}\mathclose{{}\left(G-v}\right)+X_{H}^{v}$ , where $X_{H}^{v}$ is the number of copies of $H$ in $G$ which contain $v$ . One can check that $X_{H}^{v}$ is a certain sum of weighted subgraph counts in $G-v$ , where each of the subgraphs has $v\mathopen{}\mathclose{{}\left(H}\right)-1$ vertices, and the weight of a subgraph depends on its intersection with the neighbourhood $N\mathopen{}\mathclose{{}\left(v}\right)$ of $v$ . So, if we generalise the induction hypothesis to certain weighted sums of subgraph counts, it is still possible to control the anticoncentration of $X_{H}^{v}$ inductively.

The second main obstacle, which is more serious, concerns the multiple-exposure argument we used to show that $X_{K_{h-1}}\mathopen{}\mathclose{{}\left(G\mathopen{}\mathclose{{}\left[N\mathopen{}\mathclose{{}\left(v}\right)}\right]}\right)$ is anticoncentrated even after conditioning on a typical outcome of $G-v$ . This crucially depended on the fact that in order to know the value of $X_{K_{h-1}}\mathopen{}\mathclose{{}\left(G\mathopen{}\mathclose{{}\left[N\mathopen{}\mathclose{{}\left(v}\right)}\right]}\right)$ given a particular candidate for $N\mathopen{}\mathclose{{}\left(v}\right)$ , the only edges of $G-v$ that we need to expose are the edges inside $N\mathopen{}\mathclose{{}\left(v}\right)$ (leaving the remaining edges for future rounds of exposure). Unfortunately, in general one may need to examine all the edges of $G-v$ to determine the value of $X_{H}^{v}$ , even if we fix a candidate for $N\mathopen{}\mathclose{{}\left(v}\right)$ . Specifically, this is the case whenever $H$ is not a complete multipartite graph (if $H$ is complete multipartite, then we do not need to expose the edges which lie completely outside $N\mathopen{}\mathclose{{}\left(v}\right)$ , and actually in this special case it is not too hard to extend the proof in [19] to prove Conjecture 1.1).

In the general case where $H$ is not a complete multipartite graph, in order to use a moment argument as above, we need some other way to estimate the joint probability that many candidates for $N\mathopen{}\mathclose{{}\left(v}\right)$ each result in a specific outcome of $X_{H}^{v}$ . Write $X_{H}^{v}\mathopen{}\mathclose{{}\left(N\mathopen{}\mathclose{{}\left(v}\right),G-v}\right)$ to indicate the dependence of $X_{H}^{v}$ on both $N\mathopen{}\mathclose{{}\left(v}\right)$ and $G-v$ . For a collection of sets $A_{1},\dots,A_{t}$ as candidates for $N\mathopen{}\mathclose{{}\left(v}\right)$ , we want to control the joint probability that all of $X_{H}^{v}\mathopen{}\mathclose{{}\left(A_{1},G-v}\right),\dots,X_{H}^{v}\mathopen{}\mathclose{{}\left(A_{t},G-v}\right)$ are equal to a given value. Since we cannot consider these random variables separately anymore (as we did before with the multiple exposure argument), we need to somehow modify the induction hypothesis to handle this joint probability.

Specifically, we can generalise to a statement about joint anticoncentration probabilities of random variables of the following type (thus strengthening the induction hypothesis). Take a sequence of distinct vertices $v_{1},\dots,v_{g}$ and for each $v_{i}$ , consider some collection of $t_{i}$ different candidates for the neighbourhood of $v_{i}$ . Then, consider the $T=t_{1}\dotsm t_{g}$ different random variables obtained by making different choices for the neighbourhoods for each of $v_{1},\dots,v_{g}$ , and considering the number of copies of $H$ which contain all of $v_{1},\dots,v_{g}$ , conditioned on $v_{1},\dots,v_{g}$ having these neighbourhoods. The idea is that at each step of the induction we introduce a new vertex $v_{j}$ , and we consider many candidates for the neighbourhood of $v_{j}$ for a moment argument. Given that we are considering joint probabilities of $T=t_{1}\dotsm t_{g}$ random variables, our induction hypothesis needs to give a bound of the form $n^{(g-h+1)T+o(1)}$ on the joint probability that all our random variables are equal to particular values. This can be viewed as an anticoncentration bound for a random vector $\boldsymbol{X}$ .

To make the above ideas work, we need multivariate generalisations of some of the ideas described so far. For example, we need an anticoncentration inequality for random vectors that are almost-linear in the sense sketched earlier, having the property that adding or removing an edge typically causes a predictable change in their values. There are some classical anticoncentration inequalities by Halász [21] that give the kind of bounds we need, for random vectors that depend linearly (and “non-degenerately”) on a sequence of independent random choices. (Some kind of non-degeneracy assumption is necessary, to rule out situations where the random vector is always contained in a proper subspace of smaller dimension). The standard proofs of Halász’ inequalities are Fourier-analytic, and are not robust enough to apply to our almost-linear setting, but we were able to find some combinatorial arguments that apply to our setting, again inspired by proofs of Erdős [18] and Lubell [28]. More details are in Section 3.

In order to use the estimates in Section 3, we need to check a non-degeneracy condition: basically, we need to consider the effects of changing the status of various edges, and we need to show that the corresponding changes to $\boldsymbol{X}$ are in “many different directions”, spanning $\mathbb{R}^{T}$ . We also need to check a similar non-degeneracy condition for the effects of adding or removing vertices from the various candidate neighbourhoods. Unfortunately, these non-degeneracy conditions do not hold for an arbitrary connected graph $H$ (they do hold, however, if $H$ has a vertex with edges to all other vertices). Therefore, we actually need to further modify our approach.

Instead of considering a single vertex $v_{j}$ in each step of the induction, we will consider $a_{j}$ different vertices, having a diverse range of adjacencies to the vertices previously chosen. Then, our decomposition is that we split the copies of $H$ into the copies that contain none of our $a_{j}$ identified vertices, and the copies that contain at least one of them. With this modification, there is a much richer range of possibilities for the effect of changing the status of an edge, and this allows us to prove the desired non-degeneracy condition. Unfortunately, while this modification is conceptually rather simple, it complicates notation enormously. We now need to maintain a collection of sets of vertices, and a collection of possibilities for the neighbourhoods of these vertices. To encode all of these data we introduce the notion of a colour system: each step of the induction is associated with a different colour, and at each step there are multiple “shades” of each colour indicating the different possibilities for the neighbourhoods of the various vertices introduced at that step. We can then state our induction hypothesis for a class of random variables defined in terms of colour systems, and prove it using the ideas we discussed in this outline.

3 Anticoncentration for “almost-linear” random vectors

The Erdős–Littlewood–Offord theorem states that if $\xi_{1},\dots,\xi_{n}$ are independent Bernoulli random variables satisfying $\Pr(\xi_{i}=0)=\Pr(\xi_{i}=1)=1/2$ , and $X=a_{1}\xi_{1}+\dots+a_{n}\xi_{n}$ is a linear combination of these random variables (where each coefficient $a_{i}$ has absolute value at least one) then for any $x\in\mathbb{R}$ we have $\Pr(|X-x|\leq 1)=O(1/\sqrt{n})$ . As outlined in Section 2, in [19, Theorem 1.2] we adapted Erdős’ proof of this theorem to handle the case of “almost-linear” functions of $\xi_{1},\dots,\xi_{n}$ .

In [21], among other results, Halász gave a multivariate generalisation of the Erdős–Littlewood–Offord theorem, for sums of random vectors satisfying a certain non-degeneracy condition. Specifically, suppose that $\boldsymbol{a}_{1},\dots,\boldsymbol{a}_{n}\in\mathbb{R}^{d}$ are $d$ -dimensional vectors with the property that for every unit vector $\boldsymbol{e}\in\mathbb{R}^{d}$ , there are $\Omega\mathopen{}\mathclose{{}\left(n}\right)$ indices $i$ with $\mathopen{}\mathclose{{}\left|\mathopen{}\mathclose{{}\left\langle\boldsymbol{a}_{i},\boldsymbol{e}}\right\rangle}\right|\geq 1$ . Halász proved that, with $\boldsymbol{X}=\xi_{1}\boldsymbol{a}_{1}+\dots+\xi_{n}\boldsymbol{a}_{n}$ , we have $\max_{\boldsymbol{x}\in\mathbb{R}^{d}}\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left\|\boldsymbol{X}-\boldsymbol{x}}\right\|_{2}\leq 1}\right)=O\mathopen{}\mathclose{{}\left(n^{-d/2}}\right)$ . As outlined in Section 2, for the proof of Theorem 1.2 we will need a similar bound for almost-linear $\boldsymbol{X}$ . Our non-degeneracy condition will be somewhat cruder than Halász’; we assume that there are vectors $\boldsymbol{v}_{1},\dots,\boldsymbol{v}_{m}\in\mathbb{R}^{d}$ , spanning $\mathbb{R}^{d}$ , such that each of these vectors is represented $\Omega\mathopen{}\mathclose{{}\left(n}\right)$ times as the direction of the “typical effect” of changing the status of some $\xi_{i}$ .

Theorem 3.1.

Fix real numbers $0<p<1$ and $0<\varepsilon<1$ , integers $m\geq d\geq 1$ , and vectors $\boldsymbol{v}_{1},\dots,\boldsymbol{v}_{m}\in\mathbb{R}^{d}$ spanning $\mathbb{R}^{d}$ . Then there is a constant $c>0$ , depending on $p$ , $\varepsilon$ , $d$ and $\boldsymbol{v}_{1},\dots,\boldsymbol{v}_{m}$ , such that the following holds. For any positive integer $n$ and any function $\boldsymbol{f}:\mathopen{}\mathclose{{}\left\{0,1}\right\}^{n}\to\mathbb{R}^{d}$ , let $\boldsymbol{\xi}\sim\operatorname{Ber}\mathopen{}\mathclose{{}\left(p}\right)^{n}$ and for $i=1,\dots,n$ define the random variables

[TABLE]

Suppose that for some positive real numbers $r$ and $s$ with $r\sqrt{n\log n}\geq s$ there are disjoint subsets $I_{1},\dots,I_{m}\subseteq\mathopen{}\mathclose{{}\left\{1,\dots,n}\right\}$ of size at least $\varepsilon n$ such that for each $i\in I_{j}$ we have $\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left\|\Delta_{i}\boldsymbol{f}\mathopen{}\mathclose{{}\left(\boldsymbol{\xi}}\right)-s\boldsymbol{v}_{j}}\right\|_{\infty}\geq r}\right)\leq n^{-6d^{2}}$ . Then for any $\boldsymbol{x}\in\mathbb{R}^{d}$ we have

[TABLE]

Roughly speaking, the assumption on the function $\boldsymbol{f}$ in Theorem 3.1 means the following: For each of the vectors $\boldsymbol{v}_{j}$ (with $1\leq j\leq m$ ) there is a reasonably large subset $I_{j}\subseteq\{1,\dots,n\}$ , such that for every $i\in I_{j}$ the following holds. When changing the $i$ -th coordinate of $\boldsymbol{\xi}$ from [math] to $1$ the corresponding vectors $\boldsymbol{f}(\boldsymbol{\xi})$ typically differ by roughly $s\boldsymbol{v}_{j}$ (more precisely, with high probability the difference of the corresponding vectors $\boldsymbol{f}(\boldsymbol{\xi})$ is close to the vector $s\boldsymbol{v}_{j}$ ). This condition can be seen as some sort of “almost-linearity” (at least with respect to certain coordinates of $\boldsymbol{\xi}$ ). Intuitively, this condition suggests that for a random vector $\boldsymbol{\xi}\sim\operatorname{Ber}\mathopen{}\mathclose{{}\left(p}\right)^{n}$ , the vector $\boldsymbol{f}(\boldsymbol{\xi})$ must be reasonably spread out and not too concentrated close to any given $\boldsymbol{x}\in\mathbb{R}^{d}$ . Theorem 3.1 makes this intuition precise. The precise bound of $n^{-6d^{2}}$ for the probability $\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left\|\Delta_{i}\boldsymbol{f}\mathopen{}\mathclose{{}\left(\boldsymbol{\xi}}\right)-s\boldsymbol{v}_{j}}\right\|_{\infty}\geq r}\right)$ in the assumptions of Theorem 3.1 was chosen for convenience in the proof of the theorem. The exponent $6d^{2}$ is certainly not sharp, but the precise value of this exponent is not relevant for the remainder of this paper, so we made no effort to optimise the exponent at the cost of complicating the proof of Theorem 3.1.

As mentioned above, Theorem 3.1 can be considered to be an analogue of Halász’ classical anticoncentration inequality for linear vector-valued functions [21] (satisfying a certain non-degeneracy condition), in the weaker setting of “almost-linear” functions. However, our non-degeneracy condition is somewhat more restrictive than the one in Halász’ original result. We also remark that an inequality very similar to Halasz’ was also proved by Tao and Vu [39, Theorem 1.4], and that there is a large body of work proving similar results without a non-degeneracy condition (in which case the bounds are much weaker; see for example the survey in [31, Section 2]).

Before starting the proof of Theorem 3.1, we record the following basic fact about lattices.

Lemma 3.2.

Fix a basis $\boldsymbol{v}_{1},\dots,\boldsymbol{v}_{d}\in\mathbb{R}^{d}$ of $\mathbb{R}^{d}$ . There exists a constant $c^{\prime}$ , only depending on $\boldsymbol{v}_{1},\dots,\boldsymbol{v}_{d}$ , such that for any $\boldsymbol{x^{\prime}}\in\mathbb{R}^{d}$ and any real number $z\geq 1$ there are at most $c^{\prime}\cdot z^{d}$ different $d$ -tuples of integers $(t_{1},\dots,t_{d})\in\mathbb{Z}^{d}$ with $\mathopen{}\mathclose{{}\left\|(t_{1}\boldsymbol{v}_{1}+\dots+t_{d}\boldsymbol{v}_{d})-\boldsymbol{x^{\prime}}}\right\|_{\infty}<z$ .

We next prove Theorem 3.1, further developing the ideas in the proof of [19, Theorem 1.2]. As mentioned above, the assumption on $\boldsymbol{f}$ in Theorem 3.1 means that when changing the $i$ -th coordinate of $\boldsymbol{\xi}$ from zero to one for some $i\in I_{j}$ , the vector $\boldsymbol{f}(\boldsymbol{\xi})$ typically changes by roughly $s\boldsymbol{v}_{j}$ . This means that, if we successively change appropriately chosen zeros to ones in $\boldsymbol{\xi}$ , we can (with sufficiently high probability) control the changes of the vector $\boldsymbol{f}(\boldsymbol{\xi})$ , and show that $\boldsymbol{f}(\boldsymbol{\xi})$ cannot be too often close to any given vector $\boldsymbol{x}\in\mathbb{R}^{d}$ . Indeed, if we change the coordinates of $\boldsymbol{\xi}$ with indices in $I_{1}\cup\dots\cup I_{m}$ , then the vector $\boldsymbol{f}(\boldsymbol{\xi})$ will typically move roughly along a lattice spanned by the vectors $\boldsymbol{v}_{1},\dots,\boldsymbol{v}_{m}$ , and we can use Lemma 3.2 to show that there are not too many choices for $\xi$ where $\mathopen{}\mathclose{{}\left\|\boldsymbol{f}(\boldsymbol{\chi})-\boldsymbol{x}}\right\|_{\infty}<r\sqrt{n\log n}$ .

Proof of Theorem 3.1.

First, by relabelling the given vectors $\boldsymbol{v}_{1},\dots,\boldsymbol{v}_{m}\in\mathbb{R}^{d}$ , we may assume that $\boldsymbol{v}_{1},\dots,\boldsymbol{v}_{d}$ form a basis of $\mathbb{R}^{d}$ . We ignore the other vectors $\boldsymbol{v}_{d+1},\dots,\boldsymbol{v}_{m}$ (in other words, we may assume that $m=d$ ).

Now, for $j=1,\dots,d$ , let $n_{j}=|I_{j}|$ , so $\varepsilon n\leq n_{j}\leq n$ . Let $\boldsymbol{\xi}^{j}=\mathopen{}\mathclose{{}\left(\xi_{i}}\right)_{i\in I_{j}}$ be the restriction of the random vector $\boldsymbol{\xi}\sim\operatorname{Ber}\mathopen{}\mathclose{{}\left(p}\right)^{n}$ to the coordinates in $I_{j}$ . Observe that the number $|\boldsymbol{\xi}^{j}|$ of ones in $\boldsymbol{\xi}^{j}$ is binomially distributed with parameters $n_{j}$ and $p$ . Therefore, for any $0\leq t\leq n_{j}$ we have that

[TABLE]

for a constant $c_{p}>0$ only depending on $p$ .

Now, observe that $\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left|\boldsymbol{\xi}^{j}}\right|=t}\right)$ is an increasing function in $t$ for $t\leq pn_{j}$ and a decreasing function for $t\geq pn_{j}$ . Thus, for each $j=1,\dots,d$ , there are integers $0\leq a_{j}\leq b_{j}\leq n_{j}$ such that for any integer $t$ we have

[TABLE]

That is to say, $a_{j}$ and $b_{j}$ are defined as the boundaries of the range of values that have probability at least $n^{-2d}$ of occurring as $\mathopen{}\mathclose{{}\left|\boldsymbol{\xi}^{j}}\right|$ . We next bound the difference $b_{j}-a_{j}$ .

The Chernoff bound (see for example [1, Theorem A.1.4]) yields $\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left|\boldsymbol{\xi}^{j}}\right|<pn_{j}-d\sqrt{n\log n}}\right)\leq n^{-2d^{2}}\leq n^{-2d}$ and $\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left|\boldsymbol{\xi}^{j}}\right|>pn_{j}+d\sqrt{n\log n}}\right)\leq n^{-2d^{2}}\leq n^{-2d}$ . Thus, we must have

[TABLE]

and in particular $b_{j}-a_{j}\leq 2d\sqrt{n\log n}$ for each $j=1,\dots,d$ . By the choice of $a_{j}$ and $b_{j}$ we have

[TABLE]

for every $j=1,\dots,d$ .

Now, for each $j=1,\dots,d$ , let $\sigma_{j}:[n_{j}]\to I_{j}$ be a uniformly random bijection (independently chosen for each $j$ ). Also, independently sample $\chi_{i}\sim\operatorname{Ber}\mathopen{}\mathclose{{}\left(p}\right)$ for each $i\in[n]\!\setminus\!(I_{1}\cup\dots\cup I_{d})$ . For any integers $t_{1},\dots,t_{d}\in[a_{1},b_{1}]\times\dots\times[a_{d},b_{d}]$ , let the vector $\boldsymbol{\chi}(t_{1},\dots,t_{d})\in\{0,1\}^{n}$ be defined as follows. For $i\in[n]\!\setminus\!(I_{1}\cup\dots\cup I_{d})$ , we already chose $\chi_{i}$ , the $i$ -th entry of $\boldsymbol{\chi}(t_{1},\dots,t_{d})$ . If $i\in I_{j}$ , then set $\chi_{i}=1$ if and only if $i\in\sigma_{j}([t_{i}])$ . In other words, among the entries $\chi_{i}$ for $i\in I_{j}$ there are precisely $t_{j}$ ones and those are in positions $\sigma_{j}\mathopen{}\mathclose{{}\left(1}\right),\dots,\sigma_{j}\mathopen{}\mathclose{{}\left(t_{j}}\right)$ .

For any given $(t_{1},\dots,t_{d})\in[a_{1},b_{1}]\times\dots\times[a_{d},b_{d}]$ , the random vector $\boldsymbol{\chi}(t_{1},\dots,t_{d})$ depends on the choices of the bijections $\sigma_{j}$ for $j=1,\dots,d$ and the random entries $\chi_{i}\sim\operatorname{Ber}\mathopen{}\mathclose{{}\left(p}\right)$ for $i\in[n]\!\setminus\!(I_{1}\cup\dots\cup I_{d})$ . Very importantly, the distribution of $\boldsymbol{\chi}(t_{1},\dots,t_{d})$ is the same as the distribution of the random vector $\boldsymbol{\xi}\sim\operatorname{Ber}\mathopen{}\mathclose{{}\left(p}\right)^{n}$ in the theorem statement conditioned on having $\mathopen{}\mathclose{{}\left|\boldsymbol{\xi}^{j}}\right|=t_{j}$ for $j=1,\dots,d$ . In particular, fixing any $\boldsymbol{x}\in\mathbb{R}^{d}$ , we have

[TABLE]

Hence, using the independence of the random variables $|\boldsymbol{\xi}^{1}|,\dots,|\boldsymbol{\xi}^{d}|$ as well as Equation 3.1, we obtain

[TABLE]

for each $(t_{1},\dots,t_{d})\in[a_{1},b_{1}]\times\dots\times[a_{d},b_{d}]$ . On the other hand, Equation 3.3 implies

[TABLE]

(Note that to have $|I_{j}|\geq\varepsilon n>0$ for $j=1,\dots,d$ , we must have $d\leq n$ ). Thus, we obtain

[TABLE]

where the sum is taken over all $(t_{1},\dots,t_{d})\in[a_{1},b_{1}]\times\dots\times[a_{d},b_{d}]$ . In other words,

[TABLE]

where $Y$ is the random variable counting the number of $d$ -tuples $(t_{1},\dots,t_{d})\in[a_{1},b_{1}]\times\dots\times[a_{d},b_{d}]$ with $\mathopen{}\mathclose{{}\left\|\boldsymbol{f}\mathopen{}\mathclose{{}\left(\boldsymbol{\chi}(t_{1},\dots,t_{d})}\right)-\boldsymbol{x}}\right\|_{\infty}<r\sqrt{n\log n}$ . This random variable $Y$ depends on the random choices of $\sigma_{j}$ for $j=1,\dots,d$ and on $\chi_{i}\sim\operatorname{Ber}\mathopen{}\mathclose{{}\left(p}\right)$ for $i\in[n]\!\setminus\!(I_{1}\cup\dots\cup I_{d})$ .

Note that we always have $Y\leq(n_{1}+1)\dotsm(n_{d}+1)\leq(2n)^{d}$ . Furthermore, recall that the distribution of $\boldsymbol{\chi}(t_{1},\dots,t_{d})$ is the same as the distribution of $\boldsymbol{\xi}\sim\operatorname{Ber}\mathopen{}\mathclose{{}\left(p}\right)^{n}$ conditioned on having $\mathopen{}\mathclose{{}\left|\boldsymbol{\xi}^{j}}\right|=t_{j}$ for $j=1,\dots,d$ . This implies that for any $1\leq j^{*}\leq d$ , any $i\in I_{j^{*}}$ and any $(t_{1},\dots,t_{d})\in[a_{1},b_{1}]\times\dots\times[a_{d},b_{d}]$ , we have

[TABLE]

where we used Equation 3.2 and the assumption in the theorem that $\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left\|\Delta_{i}\boldsymbol{f}\mathopen{}\mathclose{{}\left(\boldsymbol{\boldsymbol{\xi}}}\right)-s\boldsymbol{v}_{j^{*}}}\right\|_{\infty}\geq r}\right)\leq n^{-6d^{2}}$ . Thus, by a union bound, the probability that there exist $1\leq j^{*}\leq d$ , $i\in I_{j^{*}}$ and $(t_{1},\dots,t_{d})\in[a_{1},b_{1}]\times\dots\times[a_{d},b_{d}]$ with $\mathopen{}\mathclose{{}\left\|\Delta_{i}\boldsymbol{f}\mathopen{}\mathclose{{}\left(\boldsymbol{\chi}(t_{1},\dots,t_{d})}\right)-s\boldsymbol{v}_{j^{*}}}\right\|_{\infty}\geq r$ , is at most $n^{-2d-2}\cdot d\cdot n\cdot n^{d}\leq n^{-d}$ .

Now, fix $c^{\prime}>0$ , only depending on $\boldsymbol{v}_{1},\dots,\boldsymbol{v}_{d}\in\mathbb{R}^{d}$ , as in Lemma 3.2.

Claim 3.3.

If $\mathopen{}\mathclose{{}\left\|\Delta_{i}\boldsymbol{f}\mathopen{}\mathclose{{}\left(\boldsymbol{\chi}(t_{1},\dots,t_{d})}\right)-s\boldsymbol{v}_{j^{*}}}\right\|_{\infty}\leq r$ for all $1\leq j^{*}\leq d$ , all $i\in I_{j^{*}}$ and all $(t_{1},\dots,t_{d})\in[a_{1},b_{1}]\times\dots\times[a_{d},b_{d}]$ , then we have $Y\leq c^{\prime}\cdot(2d^{2}+1)^{d}\cdot(r\sqrt{n\log n}/s)^{d}$

Proof.

Note that for any $1\leq j^{*}\leq d$ and any $(t_{1},\dots,t_{d})\in[a_{1},b_{1}]\times\dots\times[a_{d},b_{d}]$ with $t_{j^{*}}<b_{j^{*}}$ , the vectors $\boldsymbol{\chi}(t_{1},\dots,t_{j^{*}-1},t_{j^{*}}+1,t_{j^{*}+1},\dots,t_{d})$ and $\boldsymbol{\chi}(t_{1},\dots,t_{d})$ only differ in that the first of these vectors has a one in position $\sigma_{j^{*}}(t_{j^{*}}+1)$ , while the second has a zero in that position. Hence

[TABLE]

As $\sigma_{j^{*}}(t_{j^{*}}+1)\in I_{j^{*}}$ , under the assumptions of the claim this implies

[TABLE]

for all $(t_{1},\dots,t_{d})\in[a_{1},b_{1}]\times\dots\times[a_{d},b_{d}]$ and all $1\leq j^{*}\leq d$ with $t_{j^{*}}<b_{j^{*}}$ . Adding this up for different values of $(t_{1},\dots,t_{d})\in[a_{1},b_{1}]\times\dots\times[a_{d},b_{d}]$ and using the triangle inequality, this implies that

[TABLE]

for all $(t_{1},\dots,t_{d})\in[a_{1},b_{1}]\times\dots\times[a_{d},b_{d}]$ , where for the second inequality we used that $t_{j}-a_{j}\leq b_{j}-a_{j}\leq 2d\sqrt{n\log n}$ for each $1\leq j\leq d$ .

Recall that $Y$ is the number of (integer) $d$ -tuples $(t_{1},\dots,t_{d})\in[a_{1},b_{1}]\times\dots\times[a_{d},b_{d}]$ which satisfy $\mathopen{}\mathclose{{}\left\|\boldsymbol{f}\mathopen{}\mathclose{{}\left(\boldsymbol{\chi}(t_{1},\dots,t_{d})}\right)-\boldsymbol{x}}\right\|_{\infty}<r\sqrt{n\log n}$ . For each such $d$ -tuple we then have (by the triangle inequality)

[TABLE]

and therefore

[TABLE]

Thus by Lemma 3.2 applied with $\boldsymbol{x^{\prime}}=a_{1}\boldsymbol{v}_{1}+\dots+a_{d}\boldsymbol{v}_{d}-\frac{1}{s}\boldsymbol{f}(\boldsymbol{\chi}(a_{1},\dots,a_{d})))+\frac{1}{s}\boldsymbol{x}$ as well as $z=(2d^{2}+1)\sqrt{n\log n}\cdot r/s$ (note that $z\geq 1$ as $r\sqrt{n\log n}\geq s$ by the assumptions of the theorem), we obtain that

[TABLE]

as desired. ∎

Just before Claim 3.3, we proved that its assumptions are satisfied with probability at least $1-n^{-d}$ . Thus, we obtain

[TABLE]

using that $r\sqrt{n\log n}\geq s$ . Plugging this into Equation 3.4, we can conclude

[TABLE]

where in the last inequality we used that $r\sqrt{\log n}/s\geq n^{-1/2}\geq n^{-1}$ . This implies the statement of Theorem 3.1 with $c=(c_{p}/\sqrt{\varepsilon})^{d}\cdot(c^{\prime}+1)\cdot(2d^{2}+1)^{d}+1$ . ∎

4 Colour systems and the induction hypothesis

As outlined in Section 2, our proof of Theorem 1.2 proceeds via induction over a class of random variables generalising subgraph counts. These random variables are defined via colour systems.

Definition 4.1.

For integers $g\geq 0$ and $a_{1},\dots,a_{g},t_{1},\dots,t_{g}\geq 1$ , a colour system $\mathcal{G}$ with parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g})$ is a multigraph without loops which is coloured according to the following rules.

•

Each vertex has at most one colour and for each $1\leq i\leq g$ , there are exactly $a_{i}$ vertices of colour $i$ .

•

Each edge is incident to at least one coloured vertex.

•

Each edge has exactly one colour. If an edge is incident to exactly one coloured vertex, it receives the colour of that vertex. If an edge is incident to two coloured vertices, and these vertices have colours $i_{1}$ and $i_{2}$ , then the edge has colour $\min(i_{1},i_{2})$ .

•

Each edge of colour $i$ is additionally labelled with an integer in $\{1,\dots,t_{i}\}$ (we say that there are $t_{i}$ different shades) of colour $i$ . We do not assign shades to vertices, only edges.

•

Between any two vertices, there is at most one edge of each shade of each colour (but there can be multiple edges of different shades of the same colour).

The order of a colour system $\mathcal{G}$ is its total number of vertices (both coloured and uncoloured).

For a colour system $\mathcal{G}$ , let $\operatorname{U}(\mathcal{G})$ be the set of uncoloured vertices of $\mathcal{G}$ . In most of the statements throughout the paper, we will consider colour systems whose order $n$ is large with respect to the parameters $g,a_{1},\dots,a_{g},t_{1},\dots,t_{g}$ (in which case almost all vertices of $\mathcal{G}$ are uncoloured). In all of our statements involving asymptotic notation, the colour system parameters $g,a_{1},\dots,a_{g},t_{1},\dots,t_{g}$ will be treated as fixed constants for the asymptotic notation, while the order $n$ tends to infinity.

To make some sense of the definition of a colour system, recall from the outline in Section 2 that our proof is inductive, and at each step of the induction we consider multiple possibilities for the neighbourhoods of certain vertices. The $g$ colours in a colour system correspond to the vertices chosen at the $g$ different steps of the induction, and the $t_{i}$ different shades of colour $i$ correspond to $t_{i}$ different choices of neighbourhoods for the vertices of colour $i$ .

Now, we will mostly want to consider colour systems which have “typical” structure, meaning that the sizes of intersections between neighbourhoods of vertices are about what one should expect if the neighbourhoods were chosen randomly. For this, we introduce a notion of “general position” for families of sets.

Definition 4.2.

Consider subsets $S_{1},\dots,S_{m}$ of some ground set $R$ . For any subset $I\subseteq\mathopen{}\mathclose{{}\left\{1,\dots,m}\right\}$ , we write $S_{I}=R\cap\bigcap_{i\in I}S_{i}\cap\bigcap_{i\notin I}(R\!\setminus\!S_{i})$ . For an integer $K\geq 1$ and some $0<p<1$ , we say that $S_{1},\dots,S_{m}\subseteq R$ are in $(p,K)$ -general position if for each $I\subseteq\mathopen{}\mathclose{{}\left\{1,\dots,m}\right\}$ , we have

[TABLE]

Note that if $m=0$ , then $S_{\emptyset}=R$ and therefore in this case the empty collection of sets is in $(p,K)$ -general position for every integer $K\geq 1$ .

Definition 4.3.

Say that a colour system $\mathcal{G}$ is $p$ -general if the following holds. If we define $S_{1},\dots,S_{m}\subseteq\operatorname{U}(\mathcal{G})$ to be the $\operatorname{U}(\mathcal{G})$ -neighbourhoods of each of the coloured vertices of $\mathcal{G}$ in each of the shades of the respective colour (so we have $m=a_{1}\cdot t_{1}+\dots+a_{g}\cdot t_{g}$ if $\mathcal{G}$ has parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g})$ ), then the sets $S_{1},\dots,S_{m}\subseteq\operatorname{U}(\mathcal{G})$ are in $(p,3^{g})$ -general position.

Furthermore, say that the colour system $\mathcal{G}$ is weakly $p$ -general if the sets $S_{1},\dots,S_{m}\subseteq\operatorname{U}(\mathcal{G})$ are in $(p,2\cdot 3^{g})$ -general position.

Note that every $p$ -general colour system is in particular weakly $p$ -general (the reason for having both these definitions is that when we make small changes to a collection of sets in $(p,K)$ -general position, the parameter $K$ changes slightly, and it is convenient to not have to explicitly keep track of this change). Also, note that for $g=0$ , every colour system with parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g})$ is $p$ -general (since $m=0$ and the empty collection of sets is always in in $(p,K)$ -general position for all $K\geq 1$ ).

Recall from Section 2 that the whole point of introducing multiple vertices at each step of the induction is to allow for a richer range of possibilities for the effect of changing the status of an edge. In order to ensure the richest possible range of possibilities, we consider colour systems which are complete in the sense that we see essentially all the possible adjacencies between the coloured vertices, as follows.

Definition 4.4.

Call a colour system $\mathcal{G}$ with parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g})$ complete, if for any $1\leq i\leq g$ the following holds. Suppose for each $1\leq j\leq i-1$ and each vertex $v$ in $\mathcal{G}$ of colour $j$ we are given a subset $I_{v}\subseteq[t_{j}]$ . Then there exists a vertex $w$ of colour $i$ such that for every $1\leq j\leq i-1$ and every vertex $v$ of colour $j$ the vertices $w$ and $v$ are connected by edges of exactly those shades of colour $j$ that belong to the set $I_{v}$ .

Informally speaking, Definition 4.4 demands that for every colour $i$ we can find a vertex with prescribed edges to all the vertices of the previous (smaller) colours. For $g=0$ every colour system with parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g})$ is complete, as the condition in Definition 4.4 is vacuous.

Note that whether a colour system $\mathcal{G}$ with parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g})$ is complete only depends on the edges between the coloured vertices, and it does not at all depend on the edges in colour $g$ . In contrast, whether $\mathcal{G}$ is $p$ -general for given $0<p<1$ only depends on the edges between the coloured and the uncoloured vertices.

Now, to obtain a graph from a colour system, we choose a shade of each colour, and we choose a graph on the uncoloured vertices, as follows.

Definition 4.5.

Let $\mathcal{G}$ be a colour system with parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g})$ . Then, given a $g$ -tuple $(j_{1},\dots,j_{g})\in[t_{1}]\times\dots\times[t_{g}]$ , and a graph $G_{0}$ on the vertex set $\operatorname{U}(\mathcal{G})$ , define a graph $\mathcal{G}(G_{0},j_{1},\dots,j_{g})$ by taking all vertices of $\mathcal{G}$ together with all edges of $G_{0}$ and all edges of shade $j_{i}$ of colour $i$ for all $1\leq i\leq g$ . Furthermore, for a graph $H$ let $\psi_{H}(\mathcal{G},G_{0},j_{1},\dots,j_{g})$ be the number of labelled copies of $H$ in the graph $\mathcal{G}(G_{0},j_{1},\dots,j_{g})$ which use at least one vertex of each of the $g$ colours.

We will use notation such as $\psi_{H}(\mathcal{G},G_{0},\cdot)$ to denote the function $[t_{1}]\times\dots\times[t_{g}]\to\mathbb{Z}$ that maps $(j_{1},\dots,j_{g})$ to $\psi_{H}(\mathcal{G},G_{0},j_{1},\dots,j_{g})$ . Our goal for the rest of this paper will be to prove the following strengthening of Theorem 1.2.

Theorem 4.6.

Fix some $0<p<1$ , an $h$ -vertex graph $H$ , and integers $0\leq g\leq h-1$ and $a_{1},\dots,a_{g},\allowbreak t_{1},\dots,t_{g}\geq 1$ . Let $T=t_{1}\dotsm t_{g}$ . Then for any $p$ -general complete colour system $\mathcal{G}$ of order $n$ with parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g})$ and for any function $\lambda:[t_{1}]\times\dots\times[t_{g}]\to\mathbb{Z}$ the following holds. If $G_{0}\sim\mathbb{G}(\operatorname{U}(\mathcal{G}),p)$ is a random graph on the vertex set $\operatorname{U}(\mathcal{G})$ , then

[TABLE]

The $o(1)$ -term in Theorem 4.6 goes to zero as $n$ tends to infinity, but it may depend on the choices for $p$ , $h$ , $H$ , and $g,a_{1},\dots,a_{g},t_{1},\dots,t_{g}$ fixed in the beginning of the theorem statement (in other words, these values are are treated as constants in the asymptotics).

Note that for $g=0$ , in Theorem 4.6 we have $T=1$ (using the convention that the empty product is equal to 1), and the colour system $\mathcal{G}$ has no coloured vertices (so it consists of $n$ uncoloured vertices and no edges). We already observed that such a colour system is always $p$ -general and complete. Furthermore, $\psi_{H}(\mathcal{G},G_{0})$ is simply the number of labelled copies of $H$ in $G_{0}\sim\mathbb{G}(n,p)$ . This quantity is precisely the random variable $X_{H}$ in Theorem 1.2. Thus, Theorem 4.6 for $g=0$ states that for all $\ell\in\mathbb{Z}$ we have $X_{H}=\ell$ with probability at most $n^{(1-h)\cdot 1+o(1)}=n^{1-h+o(1)}$ . This is precisely the statement of Theorem 1.2.

So, Theorem 1.2 corresponds to the case $g=0$ in Theorem 4.6, and it therefore suffices to prove Theorem 4.6. We will use backwards induction starting from $g=h-1$ and going down to $g=0$ . Note that the case $g=h-1$ is trival.

For the rest of the paper, we fix a particular graph $H$ with $h$ vertices and some $0<p<1$ . Before concluding this section, we make a few more definitions and state an important intermediate result for the induction step (Proposition 4.10, to follow). Basically, at each step of the induction, we have a colour system $\mathcal{G}$ with $g-1$ colours, and we add vertices of a new colour with random neighbourhoods, obtaining a random colour system $\mathcal{G_{S}}$ . We will use the “ $g$ ” case of Theorem 4.6 and a moment argument to show that typically $G_{0}$ has the property that $\psi(\mathcal{G_{S}},G_{0},\cdot)$ is anticoncentrated, subject only to the randomness in $\mathcal{G_{S}}$ . This will be the content of Proposition 4.10.

Definition 4.7.

For integers $g\geq 1$ and $a_{1},\dots,a_{g},t_{1},\dots,t_{g-1}\geq 1$ , define a restricted colour system with parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g-1})$ to be a colour system with parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g-1},1)$ in which there are no edges in colour $g$ between the coloured and the uncoloured vertices (so all edges of colour $g$ are between the vertices of colour $g$ , recalling that the colour of an edge is the minimum of the colours of its endpoints).

Call a restricted colour system $\mathcal{G}$ with parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g-1})$ complete, if it is complete when viewed as a colour system with parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g-1},1)$ . Call a restricted colour system $\mathcal{G}$ with parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g-1})$ essentially $p$ -general, if the colour system with parameters $(g-1,a_{1},\dots,a_{g-1},t_{1},\dots,t_{g-1})$ obtained by ignoring colour $g$ is $p$ -general. Similarly, call $\mathcal{G}$ essentially weakly $p$ -general, if the colour system obtained by ignoring colour $g$ is weakly $p$ -general.

Definition 4.8.

Consider a restricted colour system $\mathcal{G}$ with parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g-1})$ . For each vertex $v$ of colour $g$ in $\mathcal{G}$ , choose a random subset $S_{v}\subseteq\operatorname{U}(\mathcal{G})$ by taking each element of $\operatorname{U}(\mathcal{G})$ independently with probability $p$ (and choose the different sets $S_{v}$ all independent from each other). Let $\mathcal{S}=(S_{v})_{v}$ be the collection of random sets chosen this way. Then we can obtain a random colour system $\mathcal{G_{S}}$ with parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g-1},1)$ by connecting each vertex $v$ of colour $g$ to all vertices in $S_{v}$ and colouring all these new edges in the unique shade of colour $g$ .

Definition 4.9.

Consider $0<q<1$ and an essentially $p$ -general complete restricted colour system $\mathcal{G}$ with parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g-1})$ . We say that a graph $G_{0}$ on the vertex set $\operatorname{U}(\mathcal{G})$ is $(p,q,\mathcal{G})$ -dispersed if for all functions $\lambda:[t_{1}]\times\dots\times[t_{g-1}]\times[1]\to\mathbb{Z}$ the following holds. When choosing $\mathcal{S}$ randomly as in Definition 4.8, we have $\Pr\mathopen{}\mathclose{{}\left(\psi_{H}(\mathcal{G_{S}},G_{0},\cdot)=\lambda}\right)\leq q$ .

Now we are ready to state Proposition 4.10, as announced above.

Proposition 4.10.

Fix integers $1\leq g\leq h-1$ and $a_{1},\dots,a_{g},t_{1},\dots,t_{g-1}\geq 1$ and let $T=t_{1}\dotsm t_{g-1}$ . Then there exists a function $\delta:\mathbb{N}\to\mathbb{R}_{\geq 0}$ with $\lim_{n\to\infty}\delta(n)=0$ , such that for any essentially $p$ -general complete restricted colour system $\mathcal{G}$ of order $n$ with parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g-1})$ the following holds. If $G_{0}\sim\mathbb{G}(\operatorname{U}(\mathcal{G}),p)$ is a random graph on the vertex set $\operatorname{U}(\mathcal{G})$ , then with probability $1-n^{-\omega(1)}$ the graph $G_{0}$ is $(p,q,\mathcal{G})$ -dispersed, where $q=n^{(g-h+(1/2))\cdot T+\delta(n)}$ .

We remark that in the case $g=1$ , we have $T=1$ in Proposition 4.10 (again by the convention that the empty product is equal to $1$ ).

The rest of the paper is organised as follows. In Section 5 we give some very straightforward lemmas about sets in general position (in particular, random collections of sets are very likely to be in general position). In Section 6 we use a moment argument to prove that the “ $g$ ” case of Theorem 4.6 implies the corresponding case of Proposition 4.10. In Section 7 we show how this case of Proposition 4.10 implies the “ $g-1$ ” case of Theorem 4.6, completing the induction step. The contents of both of these sections consist mostly of calculations and putting various pieces together. However, in both these sections we omit the proofs of important anticoncentration lemmas (Lemmas 6.3 and 7.3). The rest of the paper is spent proving these lemmas via our new multivariate anticoncentration inequality in Theorem 3.1. In Section 8 we introduce the formalism of a “core”, which will be used for the proofs of Lemmas 6.3 and 7.3. To be specific, we prove a linear independence lemma for certain vectors defined in terms of cores, which we will use to check the linear independence condition in Theorem 3.1. Finally, in Section 9 we put everything together to prove Lemmas 6.3 and 7.3.

5 Sets in general position

In this section we record some simple lemmas regarding sets in general position. Recall that in the last section we fixed some $p\in(0,1)$ .

Lemma 5.1.

Suppose that $S_{1},\dots,S_{m}\subseteq R$ are subsets of some ground set $R$ which are in $(p,K)$ -general position, for some integer $K\geq 1$ . Then for every subset $I\subseteq\mathopen{}\mathclose{{}\left\{1,\dots,m}\right\}$ , we have

[TABLE]

Proof.

Fix some subset $I\subseteq\mathopen{}\mathclose{{}\left\{1,\dots,m}\right\}$ . For every subset $J\subseteq\mathopen{}\mathclose{{}\left\{1,\dots,m}\right\}$ with $I\subseteq J$ , let $S_{J}=R\cap\bigcap_{i\in J}S_{i}\cap\bigcap_{i\notin J}(R\!\setminus\!S_{i})$ be as in Definition 4.2. Note that there are at most $2^{m}$ such subsets $J$ . Also, note that the set $\bigcap_{i\in I}S_{i}$ is the union of all the set $S_{J}$ for all $J\subseteq\mathopen{}\mathclose{{}\left\{1,\dots,m}\right\}$ with $I\subseteq J$ , and all these sets $S_{J}$ are disjoint. Hence

[TABLE]

Here, and in the rest of this proof, the sum is over all subsets $J\subseteq\mathopen{}\mathclose{{}\left\{1,\dots,m}\right\}$ with $I\subseteq J$ . Noting that

[TABLE]

we obtain that

[TABLE]

as desired. ∎

Lemma 5.2.

Fix some $m\in\mathbb{N}$ . Let $R$ be an $n$ -element set and let $S_{1},\dots,S_{m}\subseteq R$ be in $(p,K)$ -general position for some integer $K\geq 1$ . Let $S_{m+1}$ be a random set chosen by taking each element of $R$ independently with probability $p$ . Then with probability $1-n^{-\omega(1)}$ the sets $S_{1},\dots,S_{m},S_{m+1}\subseteq R$ are in $(p,K)$ -general position.

Proof.

For any subset $I\subseteq\mathopen{}\mathclose{{}\left\{1,\dots,m}\right\}$ , let $S_{I}=R\cap\bigcap_{i\in I}S_{i}\cap\bigcap_{i\notin I}(R\!\setminus\!S_{i})$ be as in Definition 4.2. We need to show that with probability $1-n^{-\omega(1)}$ we have

[TABLE]

and

[TABLE]

for each $I\subseteq\mathopen{}\mathclose{{}\left\{1,\dots,m}\right\}$ . By a union bound over all $2^{m}$ choices for $I$ , it suffices that for each individual set $I\subseteq\mathopen{}\mathclose{{}\left\{1,\dots,m}\right\}$ each of these properties holds with probability $1-n^{-\omega(1)}$ .

Fix some $I\subseteq\mathopen{}\mathclose{{}\left\{1,\dots,m}\right\}$ . By assumption, the set $S_{I}\subseteq R$ satisfies

[TABLE]

Note that $\mathopen{}\mathclose{{}\left|S_{I}\cap S_{m+1}}\right|$ is binomially distributed with parameters $(\mathopen{}\mathclose{{}\left|S_{I}}\right|,p)$ . Thus, the probability to have

[TABLE]

is by the Chernoff bound at least

[TABLE]

where we used that $|S_{I}|\leq n$ as $S_{I}\subseteq R$ . Whenever Equation 5.4 is satisfied, then together with Equation 5.3 we obtain by the triangle inequality

[TABLE]

as desired. Thus Equation 5.1 indeed holds with probability $1-n^{-\omega(1)}$ . Similarly, we can show that Equation 5.2 holds with probability $1-n^{-\omega(1)}$ by considering the probability that

[TABLE]

holds, using that $\mathopen{}\mathclose{{}\left|S_{I}\cap(R\!\setminus\!S_{m+1})}\right|$ is binomially distributed with parameters $(\mathopen{}\mathclose{{}\left|S_{I}}\right|,1-p)$ . ∎

Lemma 5.3.

Fix some positive integer $\ell$ . Then the following holds for sufficiently large $n$ . Let $R$ be an $n$ -element set and let $S_{1},\dots,S_{m}\subseteq R$ be in $(p,K)$ -general position for some integer $K\geq 1$ . Let $K^{\prime}$ be an integer with $K^{\prime}>K$ . Then for any subset $R^{\prime}\subseteq R$ obtained by deleting $\ell$ elements from $R$ , the sets $S_{1}\cap R^{\prime},\dots,S_{m}\cap R^{\prime}\subseteq R^{\prime}$ are in $(p,K^{\prime})$ -general position.

Proof.

We need to show that for every subset $I\subseteq\mathopen{}\mathclose{{}\left\{1,\dots,m}\right\}$ we have

[TABLE]

So fix some $I\subseteq\mathopen{}\mathclose{{}\left\{1,\dots,m}\right\}$ . By assumption, the set $S_{I}\subseteq R$ satisfies

[TABLE]

Now, $S_{I}\cap R^{\prime}$ is obtained from $S_{I}$ by deleting at most $\ell$ elements, and therefore

[TABLE]

Furthermore, we have

[TABLE]

Thus, the triangle inequality yields (using $|R^{\prime}|=|R|-\ell$ )

[TABLE]

as long as $n=|R|$ is sufficiently large with respect to $\ell$ . ∎

Corollary 5.4.

Fix $g,a_{1},\dots,a_{g-1},t_{1},\dots,t_{g-1}$ , and suppose $n$ is sufficiently large with respect to these values. Let $\mathcal{G}$ be a $p$ -general colour system of order $n$ with parameters $(g-1,a_{1},\dots,a_{g-1},t_{1},\dots,t_{g-1})$ . Then, whenever we delete $2^{a_{1}t_{1}+\dots+a_{g-1}t_{g-1}}$ vertices in $\operatorname{U}(\mathcal{G})$ , the resulting colour system is weakly $p$ -general.

Proof.

Let $S_{1},\dots,S_{m}\subseteq\operatorname{U}(\mathcal{G})$ be the $\operatorname{U}(\mathcal{G})$ -neighbourhoods of each of the coloured vertices of $\mathcal{G}$ in each shade of their respective colours. Then $S_{1},\dots,S_{m}\subseteq\operatorname{U}(\mathcal{G})$ are in $(p,3^{g-1})$ -general position. Thus, by Lemma 5.3, whenever we delete $2^{a_{1}t_{1}+\dots+a_{g-1}t_{g-1}}$ vertices in $\operatorname{U}(\mathcal{G})$ , the resulting configuration of sets is in $(p,2\cdot 3^{g-1})$ -general position. ∎

Lemma 5.5.

Fix $g,a_{1},\dots,a_{g},t_{1},\dots,t_{g-1}$ . If $\mathcal{G}$ is a an essentially weakly $p$ -general restricted colour system of order $n$ with parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g-1})$ , then the colour system $\mathcal{G_{S}}$ in Definition 4.8 is $p$ -general with probability $1-n^{-\omega(1)}$ .

Proof.

Let $\mathcal{G}^{\prime}$ be the colour system of order $n$ with parameters $(g-1,a_{1},\dots,a_{g-1},t_{1},\dots,t_{g-1})$ obtained from the restricted colour system $\mathcal{G}$ by ignoring colour $g$ (so $\operatorname{U}(\mathcal{G}^{\prime})$ consists of the set $\operatorname{U}(\mathcal{G})$ , together with the $a_{g}$ vertices of colour $g$ in $\mathcal{G}$ ). By the assumption on $\mathcal{G}$ , the colour system $\mathcal{G}^{\prime}$ is weakly $p$ -general. Let $S_{1},\dots,S_{m}\subseteq\operatorname{U}(\mathcal{G}^{\prime})$ be the $\operatorname{U}(\mathcal{G}^{\prime})$ -neighbourhoods of each of the coloured vertices of $\mathcal{G}^{\prime}$ in each shade of their respective colours (so we have $m=a_{1}\cdot t_{1}+\dots+a_{g-1}\cdot t_{g-1}$ ). Then the sets $S_{1},\dots,S_{m}\subseteq\operatorname{U}(\mathcal{G}^{\prime})$ are in $(p,2\cdot 3^{g-1})$ -general position. Note that $|\operatorname{U}(\mathcal{G})|=n-a_{1}+\dots+a_{g}=n-O(1)$ and $|\operatorname{U}(\mathcal{G}^{\prime})|-|\operatorname{U}(\mathcal{G})|=a_{g}=O(1)$ , and that $3^{g}>2\cdot 3^{g-1}$ , so by Lemma 5.3 (assuming $n$ is large), the sets $S_{1}\cap\operatorname{U}(\mathcal{G}),\dots,S_{m}\cap\operatorname{U}(\mathcal{G})\subseteq\operatorname{U}(\mathcal{G})$ are in $(p,3^{g})$ -general position. Applying Lemma 5.2 $a_{g}$ times, we see that with probability at least $1-a_{g}\cdot|\operatorname{U}(\mathcal{G})|^{-\omega(1)}=1-n^{-\omega(1)}$ these sets together with the random sets in $\mathcal{S}$ are in $(p,3^{g})$ -general position. This proves Lemma 5.5. ∎

6 Random neighbourhoods: Theorem 4.6 implies Proposition 4.10

In this section we will prove that if Theorem 4.6 holds for some $1\leq g\leq h-1$ , then Proposition 4.10 also holds for this value of $g$ .

Fix some $1\leq g\leq h-1$ and assume that Theorem 4.6 holds for this value of $g$ . Let $a_{1},\dots,a_{g},t_{1},\dots,t_{g-1}$ be arbitrary positive integers. Our goal in this section is to prove Proposition 4.10 for these values of $g$ , $a_{1},\dots,a_{g},t_{1},\dots,t_{g-1}$ .

For the entirety of this section, we fix these values of $g,a_{1},\dots,a_{g},t_{1},\dots,t_{g-1}$ , and define $T=t_{1}\dotsm t_{g-1}$ . In all our asymptotics, these values will be treated as constants, while $n\to\infty$ .

In Section 6.1 we will start with some preparations. First, we use a martingale concentration inequality to prove that $\psi_{H}(\mathcal{G_{S}},G_{0},\cdot)$ is very likely to be close to its conditional expectation $\mu_{\mathcal{G},\mathcal{S}}$ given $\mathcal{S}$ (by symmetry, this conditional expectation actually only depends on the sizes of the intersections between the sets in $\mathcal{S}$ and the neighbourhoods of the various coloured vertices). Second, we state (but do not yet prove) an anticoncentration lemma for $\mu_{\mathcal{G},\mathcal{S}}$ , subject to the randomness in $\mathcal{S}$ .

In Section 6.2 we will use our preparatory lemmas and Theorem 4.6 to prove an anticoncentration bound for certain joint probabilities concerning the values of $\psi_{H}(\mathcal{G_{S}},G_{0},\cdot)$ for different choices of $\mathcal{S}$ . This will be the key input for a moment argument (as outlined in Section 2) with which we will deduce that $G_{0}$ is very likely to be dispersed, proving the desired case of Proposition 4.10. The details of this deduction will be presented in Section 6.3.

6.1 Preparations

Definition 6.1.

For a restricted colour system $\mathcal{G}$ with parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g-1})$ and an outcome of the random collection of sets $\mathcal{S}=(S_{v})_{v}$ in Definition 4.8, let $\mu_{\mathcal{G},\mathcal{S}}:[t_{1}]\times\dots\times[t_{g-1}]\times[1]\to\mathbb{R}$ be the function given by

[TABLE]

for all $(j_{1},\dots,j_{g})\in[t_{1}]\times\dots\times[t_{g-1}]\times[1]$ . Here, $G_{0}\sim\mathbb{G}(\operatorname{U}(\mathcal{G}),p)$ is a random graph on the vertex set $\operatorname{U}(\mathcal{G})$ .

We remark that $\mu_{\mathcal{G},\mathcal{S}}$ only depends on $\mathcal{G}$ and $\mathcal{S}$ , and not $G_{0}$ .

Lemma 6.2.

For any essentially $p$ -general complete restricted colour system $\mathcal{G}$ which has parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g-1})$ , the following holds. If we choose a random graph $G_{0}\sim\mathbb{G}(\operatorname{U}(\mathcal{G}),p)$ and independently choose $\mathcal{S}$ randomly as in Definition 4.8, then

[TABLE]

Proof.

Condition on an arbitrary outcome of $\mathcal{S}$ . By a union bound, it suffices to prove that for each $(j_{1},\dots,j_{g})\in[t_{1}]\times\dots\times[t_{g-1}]\times[1]$ the probability that

[TABLE]

is $n^{-\omega(1)}$ . So fix some $(j_{1},\dots,j_{g})\in[t_{1}]\times\dots\times[t_{g-1}]\times[1]$ . The expectation of $\psi_{H}(\mathcal{G_{S}},G_{0},j_{1},\dots,j_{g})$ is precisely $\mu_{\mathcal{G},\mathcal{S}}(j_{1},\dots,j_{g})$ (recall that we are conditioning on an outcome of $\mathcal{S}$ ).

Note that changing the status of an edge of $G_{0}$ changes $\psi_{H}(\mathcal{G_{S}},G_{0},j_{1},\dots,j_{g})$ by at most $O(n^{h-g-2})$ . This is because are at most $h^{g+2}\cdot a_{1}\dotsm a_{g}\cdot n^{h-g-2}=O(n^{h-g-2})$ different labelled copies of $H$ in the $n$ -vertex graph $\mathcal{G_{S}}(G_{0},j_{1},\dots,j_{g})$ which contain any particular pair of vertices of $\operatorname{U}(\mathcal{G})$ , and contain at least one vertex of each of the $g$ colours. Thus, by the Azuma–Hoeffding inequality (see for example [1, Theorem 7.2.1]) with the edge-exposure martingale, the probability that Equation 6.1 occurs is at most

[TABLE]

This finishes the proof of Lemma 6.2. ∎

Recall that we fixed $g,h,a_{1},\dots,a_{g}$ and $t_{1},\dots,t_{g-1}$ throughout this section, and that we defined $T=t_{1}\dotsm t_{g-1}$ .

Lemma 6.3.

For any essentially $p$ -general complete restricted colour system $\mathcal{G}$ of order $n$ with parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g-1})$ and any $\lambda:[t_{1}]\times\dots\times[t_{g-1}]\times[1]\to\mathbb{Z}$ the following holds. If we choose $\mathcal{S}$ randomly as in Definition 4.8 then we have

[TABLE]

We defer the proof of Lemma 6.3 to Section 9.

6.2 Joint anticoncentration

The following statement is the key lemma for the moment argument we will use to finish the proof of Proposition 4.10.

Lemma 6.4.

Fix $t\in\mathbb{N}$ . Then for any essentially $p$ -general complete restricted colour system $\mathcal{G}$ of order $n$ with parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g-1})$ , and any function $\lambda:[t_{1}]\times\dots\times[t_{g-1}]\times[1]\to\mathbb{Z}$ , the following holds. If we choose, all independently, a random graph $G_{0}\sim\mathbb{G}(\operatorname{U}(\mathcal{G}),p)$ on the vertex set $\operatorname{U}(\mathcal{G})$ as well as $t$ random collections $\mathcal{S}_{1},\dots,\mathcal{S}_{t}$ chosen as in Definition 4.8, then

[TABLE]

Proof.

Fix an essentially $p$ -general complete restricted colour system $\mathcal{G}$ of order $n$ with parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g-1})$ , and a function $\lambda:[t_{1}]\times\dots\times[t_{g-1}]\times[1]\to\mathbb{Z}$ .

For an outcome of the random collections $\mathcal{S}_{1},\dots,\mathcal{S}_{t}$ , we obtain a colour system $\mathcal{G}^{*}$ with parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g-1},t)$ as follows. Recall that by Definition 4.7, $\mathcal{G}$ has only one shade of colour $g$ and all edges of colour $g$ are between the vertices of colour $g$ . Replace each of these edges by $t$ parallel edges with all shades $\{1,\dots,t\}$ of colour $g$ . Also, for each $i=1,\dots,t$ and each vertex $v$ with colour $g$ , add edges, in shade $i$ of colour $g$ , between $v$ and all vertices in the set $S_{i,v}$ (where $S_{i,v}$ is the set corresponding to the vertex $v$ in the collection $\mathcal{S}_{i}$ ). We emphasise that $\mathcal{G}^{*}$ only depends on the random choices of $\mathcal{S}_{1},\dots,\mathcal{S}_{t}$ , and not on the random choice of $G_{0}\sim\mathbb{G}(\operatorname{U}(\mathcal{G}),p)$ .

Note that $\operatorname{U}(\mathcal{G}^{*})=\operatorname{U}(\mathcal{G})$ and that for any $i=1,\dots,t$ , deleting all edges of colour $g$ except those of shade $i$ yields precisely the colour system $\mathcal{G}_{\mathcal{S}_{i}}$ with parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g-1},1)$ . Thus, for any outcome of $G_{0}\sim\mathbb{G}(\operatorname{U}(\mathcal{G}),p)$ , we have

[TABLE]

for all $(j_{1},\dots,j_{g-1})\in[t_{1}]\times\dots\times[t_{g-1}]$ and all $i=1,\dots,t$ . Also note that $\mathcal{G}^{*}$ is always complete, because $\mathcal{G}$ is complete and being complete does not depend on the edges of colour $g$ .

Say that an outcome of $(G_{0},\mathcal{S}_{1},\dots,\mathcal{S}_{t})$ is common if both of the following conditions hold.

(i)

we have

[TABLE]

for all $i=1,\dots,t$ ;

(ii)

the colour system $\mathcal{G}^{*}$ given by $\mathcal{G}$ and $\mathcal{S}_{1},\dots,\mathcal{S}_{t}$ is $p$ -general.

Claim 6.5.

$(G_{0},\mathcal{S}_{1},\dots,\mathcal{S}_{t})$ * is common with probability $1-n^{-\omega(1)}$ .*

Proof.

By Lemma 6.2, for each $i=1,\dots,t$ the probability that (i) fails is at most $n^{-\omega(1)}$ . Thus, (i) holds with probability at least $1-t\cdot n^{-\omega(1)}=1-n^{-\omega(1)}$ .

For (ii), note that by Lemma 5.5 the colour system $\mathcal{G}_{\mathcal{S}_{1}}$ is $p$ -general with probability $1-n^{-\omega(1)}$ . We can then apply Lemma 5.2 $(t-1)\cdot a_{g}$ times for all the additional sets in $\mathcal{S}_{2},\dots,\mathcal{S}_{t}$ . ∎

Now, in order to prove Lemma 6.4, we need to bound the probability that

[TABLE]

We remark that if we condition on an outcome for $\mathcal{S}_{1},\dots,\mathcal{S}_{t}$ satisfying (ii), then one can apply Theorem 4.6 to get an upper bound on this probability (using only the randomness of $G_{0}$ ). However, this bound is weaker than the one claimed in Lemma 6.4. We will be able to obtain a stronger bound by using the randomness of both $G_{0}$ and of $\mathcal{S}_{1},\dots,\mathcal{S}_{t}$ . For the argument, both of the conditions (i) and (ii) will be relevant.

Whenever the outcome of $(G_{0},\mathcal{S}_{1},\dots,\mathcal{S}_{t})$ is common (specifically, when (i) is satisfied), in order to satisfy Equation 6.3 we must have

[TABLE]

Note that Equation 6.4 does not depend on $G_{0}$ , only on the random sets in $\mathcal{S}_{1},\dots,\mathcal{S}_{t}$ . By Lemma 6.3, for each $i=1,\dots,t$ the probability of having $\mathopen{}\mathclose{{}\left\|\mu_{\mathcal{G},\mathcal{S}_{i}}-\lambda}\right\|_{\infty}\leq n^{h-g-1}\cdot\log n$ is at most $n^{-T/2+o(1)}$ . So, by the independence of the $\mathcal{S}_{i}$ , the probability that Equation 6.4 holds is bounded as follows.

[TABLE]

Now, fix any outcomes of $\mathcal{S}_{1},\dots,\mathcal{S}_{t}$ satisfying (ii) and Equation 6.4. By condition (ii), the colour system $\mathcal{G}^{*}$ is $p$ -general, and furthermore recall that $\mathcal{G}^{*}$ is complete. Hence, applying Theorem 4.6 to the colour system $\mathcal{G}^{*}$ (which has parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g-1},t)$ ), conditioned on our outcomes of $\mathcal{S}_{1},\dots,\mathcal{S}_{t}$ , we have

[TABLE]

(Recall that in this section we are assuming that Theorem 4.6 holds for $g$ ). In other words, recalling Equation 6.2, if we condition on any outcomes of $\mathcal{S}_{1},\dots,\mathcal{S}_{t}$ such that (ii) and Equation 6.4 hold, then the random choice $G_{0}\sim\mathbb{G}(\operatorname{U}(\mathcal{G}),p)$ satisfies Equation 6.3 with probability at most $n^{(g-h+1)\cdot T\cdot t+o(1)}$ .

Combining this with Equation 6.5, we see that the probability that $(G_{0},\mathcal{S}_{1},\dots,\mathcal{S}_{t})$ simultaneously satisfies Equation 6.4, (ii) and Equation 6.3 is at most

[TABLE]

Recall that any common outcome satisfying condition Equation 6.3 also needs to satisfy Equation 6.4, and recall from Claim 6.5 that $(G_{0},\mathcal{S}_{1},\dots,\mathcal{S}_{t})$ is common with probability at least $1-n^{-\omega(1)}$ . Thus, the total probability that Equation 6.3 holds is at most $n^{(g-h+(1/2))\cdot T\cdot t+o(1)}+n^{-\omega(1)}=n^{(g-h+(1/2))\cdot T\cdot t+o(1)}$ . This finishes the proof of Lemma 6.4. ∎

In the proof of Proposition 4.10, we will use the following corollary of Lemma 6.4. In contrast to the setting of Lemma 6.4, in 6.6 we will not require $t\in\mathbb{N}$ to be fixed, but just that $n$ is sufficiently large with respect to $t$ (and with respect to the values for $g,h,a_{1},\dots,a_{g}$ and $t_{1},\dots,t_{g-1}$ that we fixed for this entire section). Also note that the statement of 6.6 does not contain any asymptotic notation.

Corollary 6.6.

As before, let $g,h,a_{1},\dots,a_{g}$ and $t_{1},\dots,t_{g-1}$ be fixed. Then for every $t\in\mathbb{N}$ , there exists $N(t)\in\mathbb{N}$ such that the following holds for any essentially $p$ -general complete restricted colour system $G$ of order $n\geq N(t)$ with parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g-1})$ , and any function $\lambda:[t_{1}]\times\dots\times[t_{g-1}]\times[1]\to\mathbb{Z}$ . If we choose, all independently, a random graph $G_{0}\sim\mathbb{G}(\operatorname{U}(\mathcal{G}),p)$ on the vertex set $\operatorname{U}(\mathcal{G})$ as well as $t$ random collections $\mathcal{S}_{1},\dots,\mathcal{S}_{t}$ chosen as in Definition 4.8, then

[TABLE]

Proof.

For each $t\in\mathbb{N}$ , the $o(1)$ -term in the statement of Lemma 6.4 with this particular value of $t$ converges to zero (as $n$ goes to infinity). Hence there is some $N(t)\in\mathbb{N}$ such that this $o(1)$ -term is at most $1$ for all $n\geq N(t)$ . In other words, this means that whenever $n\geq N(t)$ , the probability appearing in the statements of Lemma 6.4 and 6.6 is indeed at most $n^{(g-h+(1/2))\cdot T\cdot t+1}$ , as desired. ∎

6.3 Completing the proof of Proposition 4.10

In this subsection, we will finally deduce Proposition 4.10 for our given value of $g$ . In order to do so, we will apply 6.6 with a carefully chosen value of $t$ . Recall that $t$ is not required to be fixed in 6.6, it is only required that $n$ is sufficiently large with respect to $t$ (and the values of $g,a_{1},\dots,a_{g}$ and $t_{1},\dots,t_{g-1}$ that we fixed throughout this section). Let $N:\mathbb{N}\to\mathbb{N}$ be a function mapping each $t\in\mathbb{N}$ to some $N(t)\in\mathbb{N}$ such that 6.6 holds.

In order to prove Proposition 4.10, we can assume that $n\geq N(1)$ . For any integer $n\geq N(1)$ , let us define $t(n)$ to be the maximum $t\in\mathbb{N}$ with $N(t)\leq n$ . Note that then we have $N(t(n))\leq n$ for all $n\geq N(1)$ , and that $t(n)=\omega(1)$ .

Define the function $\delta$ by $\delta(n)=t(n)^{-1/2}$ for all $n\geq N(1)$ , and observe that $\lim_{n\to\infty}\delta(n)=0$ . Let $\mathcal{G}$ be any essentially $p$ -general complete restricted colour system $\mathcal{G}$ of order $n\geq N(1)$ with parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g-1})$ . We need to show that a randomly chosen graph $G_{0}\sim\mathbb{G}(\operatorname{U}(\mathcal{G}),p)$ on the vertex set $\operatorname{U}(\mathcal{G})$ is $(p,q,\mathcal{G})$ -dispersed with probability $1-n^{-\omega(1)}$ , where

[TABLE]

Recall from Definition 4.9 that a graph $G_{0}$ on the vertex set $\operatorname{U}(\mathcal{G})$ being $(p,q,\mathcal{G})$ -dispersed means that for all functions $\lambda:[t_{1}]\times\dots\times[t_{g-1}]\times[1]\to\mathbb{Z}$ the following condition holds: when choosing $\mathcal{S}$ randomly as in Definition 4.8, $\Pr\mathopen{}\mathclose{{}\left(\psi_{H}(\mathcal{G_{S}},G_{0},\cdot)=\lambda}\right)\leq q$ . Let $A_{\lambda}$ be the event that $G_{0}$ fails to satisfy this condition (i.e. that the probability is strictly larger than $q$ ) for some particular $\lambda$ . So, we need to show that with probability $1-n^{-\omega(1)}$ , no $A_{\lambda}$ occurs. Note that we always have $0\leq\psi_{H}(\mathcal{G_{S}},G_{0},j_{1},\dots,j_{g})\leq n^{h}$ , so we only need to consider $(n^{h}+1)^{T}$ possibilities for $\lambda$ . It therefore suffices to show that $\Pr(A_{\lambda})=n^{-\omega(1)}$ for each $\lambda$ : we then can take a union bound over all possibilities for $\lambda$ . So, fix a function $\lambda:[t_{1}]\times\dots\times[t_{g-1}]\times[1]\to\{0,\dots,n^{h}\}$ .

Let $t=t(n)$ . All independently, choose a random graph $G_{0}\sim\mathbb{G}(\operatorname{U}(\mathcal{G}),p)$ , and choose $t$ collections $\mathcal{S}_{1},\dots,\mathcal{S}_{t}$ as in Definition 4.8. By 6.6 (using that $N(t)=N(t(n))\leq n$ ), we have

[TABLE]

We remark that the probability on the left-hand side of Equation 6.6 can be interpreted as the $t$ -th moment of the random variable (depending on $G_{0}\sim\mathbb{G}(\operatorname{U}(\mathcal{G}),p)$ ) measuring the conditional probability that $\psi_{H}(\mathcal{G}_{\mathcal{S}},G_{0},\cdot)=\lambda$ , for a random choice of $\mathcal{S}$ as in Definition 4.8. The rest of the proof, to follow, can basically be interpreted as applying Markov’s inequality with this $t$ -th moment.

By the definition of $A_{\lambda}$ , if we condition on an outcome of $G_{0}$ for which $A_{\lambda}$ holds, then for each $i$ , with probability at least $q$ we have $\psi_{H}(\mathcal{G}_{\mathcal{S}_{i}},G_{0},\cdot)=\lambda$ . Since the $\mathcal{S}_{i}$ are independent, we deduce that

[TABLE]

Together with Equation 6.6, this implies

[TABLE]

and therefore, recalling $q=n^{(g-h+(1/2))\cdot T+\delta(n)}$ ,

[TABLE]

(Recall that $\delta(n)=t(n)^{-1/2}=t^{-1/2}$ and that $t=t(n)=\omega(1)$ ). This concludes the proof.

7 Completing the induction step: Proposition 4.10 for $g$ implies Theorem 4.6 for $g-1$

For the duration of this section, we fix some $1\leq g\leq h-1$ and assume that Proposition 4.10 holds for this value of $g$ . Our goal for this section is to prove Theorem 4.6 for $g-1$ . Our approach is as outlined in Section 2: we decompose $\psi_{H}(\mathcal{G},G_{0},\cdot)$ into two parts, use a “coarse-scale” anticoncentration lemma to handle the larger part, and use our assumed case of Proposition 4.10 to handle the smaller part.

Let $a_{1},\dots,a_{g-1},t_{1},\dots,t_{g-1}$ be arbitrary positive integers. Our goal is to prove Theorem 4.6 for $g-1$ and these values of $a_{1},\dots,a_{g-1},t_{1},\dots,t_{g-1}$ .

For this entire section, let us fix the values of $a_{1},\dots,a_{g-1},t_{1},\dots,t_{g-1}$ , and define $T=t_{1}\dotsm t_{g-1}$ and $a_{g}=2^{a_{1}t_{1}+\dots+a_{g-1}t_{g-1}}$ . In all our asymptotics, these values will be treated as constants, while $n\to\infty$ . In Section 7.1 we will prove a concentration lemma which we will use to control the fluctuation of the smaller of our two parts. We also state (but do not yet prove) an anticoncentration lemma which we will use for the larger of our two parts. In Section 7.2 we combine these lemmas with our assumed case of Proposition 4.10 to prove our desired case of Theorem 4.6.

7.1 Preparations

Definition 7.1.

Consider an essentially $p$ -general complete restricted colour system $\mathcal{G}$ with parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g-1})$ . Let $\mu_{\mathcal{G}}:[t_{1}]\times\dots\times[t_{g-1}]\times[1]\to\mathbb{R}$ be the function given by

[TABLE]

for all $(j_{1},\dots,j_{g})\in[t_{1}]\times\dots\times[t_{g-1}]\times[1]$ . Here, $G_{0}\sim\mathbb{G}(\operatorname{U}(\mathcal{G}),p)$ is a random graph on the vertex set $\operatorname{U}(\mathcal{G})$ and $\mathcal{S}=(S_{v})_{v}$ is a randomly chosen collection of sets as in Definition 4.8 (and $G_{0}$ and $\mathcal{S}$ are chosen independently).

We remark that $\mu_{\mathcal{G}}$ only depends on $\mathcal{G}$ .

Lemma 7.2.

The following holds for any essentially $p$ -general complete restricted colour system $\mathcal{G}$ with parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g-1})$ . If we choose a random graph $G_{0}\sim\mathbb{G}(\operatorname{U}(\mathcal{G}),p)$ and independently choose $\mathcal{S}$ randomly as in Definition 4.8, then

[TABLE]

Proof.

By a union bound, it suffices to prove that for each $g$ -tuple $(j_{1},\dots,j_{g})\in[t_{1}]\times\dots\times[t_{g-1}]\times[1]$ the probability to have

[TABLE]

is $n^{-\omega(1)}$ . So fix some $g$ -tuple $(j_{1},\dots,j_{g})\in[t_{1}]\times\dots\times[t_{g-1}]\times[1]$ . Note that the expectation of $\psi_{H}(\mathcal{G_{S}},G_{0},j_{1},\dots,j_{g})$ is precisely $\mu_{\mathcal{G}}(j_{1},\dots,j_{g})$ .

Note that the random collection $\mathcal{S}=(S_{v})_{v}$ as in Definition 4.8 consists of $a_{g}$ different sets $S_{v}\subseteq\operatorname{U}(\mathcal{G})$ , one for each vertex of colour $g$ . For any vertex $u\in\operatorname{U}(\mathcal{G})$ , changing which of the sets $S_{v}$ the vertex $u$ belongs to, changes the edges in the graph $\mathcal{G_{S}}(G_{0},j_{1},\dots,j_{g})$ between $u$ and the vertices of colour $g$ . Hence it changes the value of $\psi_{H}(\mathcal{G_{S}},G_{0},j_{1},\dots,j_{g})$ by at most $h^{g+1}\cdot a_{1}\dotsm a_{g}\cdot n^{h-g-1}=O(n^{h-g-1})$ (since there are at most that many labelled copies of $H$ in the $n$ -vertex graph $\mathcal{G_{S}}(G_{0},j_{1},\dots,j_{g})$ which contain $u$ as well as at least one vertex of each of the $g$ colours).

Consider the Doob martingale with respect to $\psi_{H}(\mathcal{G_{S}},G_{0},j_{1},\dots,j_{g})$ obtained by first exposing the graph $G_{0}\sim\mathbb{G}(\operatorname{U}(\mathcal{G}),p)$ one edge at a time, and then exposing the random collection $\mathcal{S}$ in the following way. In each step, for one vertex $u\in\operatorname{U}(\mathcal{G})$ , we expose which of the sets $S_{v}$ of the collection $\mathcal{S}$ contain the vertex $u$ . As in the proof of Lemma 6.2, changing the status of an edge of $G_{0}$ changes $\psi_{H}(\mathcal{G_{S}},G_{0},j_{1},\dots,j_{g})$ by at most $h^{g+2}\cdot a_{1}\dotsm a_{g}\cdot n^{h-g-2}=O(n^{h-g-2})$ . So, by the Azuma–Hoeffding inequality the probability that Equation 7.1 occurs is at most

[TABLE]

This finishes the proof of Lemma 7.2. ∎

Lemma 7.3.

The following holds for any weakly $p$ -general complete colour system $\mathcal{G}$ of order $n$ with parameters $(g-1,a_{1},\dots,a_{g-1},t_{1},\dots,t_{g-1})$ and any $\lambda:[t_{1}]\times\dots\times[t_{g-1}]\to\mathbb{Z}$ . If we choose a random graph $G_{0}\sim\mathbb{G}(\operatorname{U}(\mathcal{G}),p)$ on the vertex set $\operatorname{U}(\mathcal{G})$ then

[TABLE]

We defer the proof of Lemma 7.3 to Section 9.

7.2 Proof of Theorem 4.6 for $g-1$

In this subsection, we deduce Theorem 4.6 for $g-1$ from Proposition 4.10 for $g$ . Consider any $p$ -general complete colour system $\mathcal{G}$ of order $n$ with parameters $(g-1,a_{1},\dots,a_{g-1},t_{1},\dots,t_{g-1})$ and any function $\lambda:[t_{1}]\times\dots\times[t_{g-1}]\to\mathbb{Z}$ . We may assume that $n$ is sufficiently large with respect to the parameters $g-1,a_{1},\dots,a_{g-1},t_{1},\dots,t_{g-1}$ that were fixed throughout this section.

Since $\mathcal{G}$ is a $p$ -general colour system, the $\operatorname{U}(\mathcal{G})$ -neighbourhoods of each of the coloured vertices of $\mathcal{G}$ in each of the shades of the respective colour are in $p$ -general position. This implies that there exist vertices in $\operatorname{U}(\mathcal{G})$ representing all possible ways to be adjacent to the coloured vertices. To be specific, for each $1\leq j\leq g-1$ and each vertex $v$ in $\mathcal{G}$ of colour $j$ , consider any subset $I_{v}\subseteq[t_{j}]$ . There is a vertex $w\in\operatorname{U}(\mathcal{G})$ such that for every $1\leq j\leq g-1$ and every vertex $v$ of colour $j$ , between $w$ and $v$ there are edges of exactly those shades of colour $j$ that belong to the set $I_{v}$ . For each choice of the subsets $I_{v}$ , fix a particular such vertex $w$ , and let $W$ be the set of all these fixed vertices $w$ . Then, $W$ has size

[TABLE]

To prove Theorem 4.6, we need to show that for a random graph $G_{0}\sim\mathbb{G}(\operatorname{U}(\mathcal{G}),p)$ , we have

[TABLE]

Fix any possible outcome of the edges of the graph $G_{0}[W]$ between the vertices in $W$ . We will prove that the event $\psi_{H}(\mathcal{G},G_{0},\cdot)=\lambda$ occurs with probability at most $n^{(g-h)\cdot T+o(1)}$ conditioned on this outcome.

We can obtain a colour system $\mathcal{G}^{-}$ of order $n-a_{g}$ with parameters $(g-1,a_{1},\dots,a_{g-1},t_{1},\dots,t_{g-1})$ from the colour system $\mathcal{G}$ by deleting the $a_{g}$ vertices in $W\subseteq\operatorname{U}(\mathcal{G})$ . By 5.4, since $\mathcal{G}$ is $p$ -general, $\mathcal{G}^{-}$ is weakly $p$ -general. Also, $\mathcal{G}^{-}$ is complete, because $\mathcal{G}$ is complete and being complete does not depend on any of the uncoloured vertices.

Furthermore, from $\mathcal{G}$ and our fixed outcome of $G_{0}[W]$ , we can obtain a restricted colour system $\mathcal{G}^{\prime}$ of order $n$ with parameters $(g,a_{1},\dots,a_{g-1},a_{g},t_{1},\dots,t_{g-1})$ in the following way. First, colour all the $a_{g}$ vertices in $W\subseteq\operatorname{U}(\mathcal{G})$ with colour $g$ . Then, take a single shade of colour $g$ and colour all edges in $G_{0}[W]$ with this single shade of colour $g$ . The restricted colour system $\mathcal{G}^{\prime}$ so obtained is complete and essentially $p$ -general, by our choice of $W$ and the assumptions that $\mathcal{G}$ is complete and $p$ -general.

Given our fixed outcome of $G_{0}[W]$ , we can choose the rest of the random graph $G_{0}\sim\mathbb{G}(\operatorname{U}(\mathcal{G}),p)$ in two steps as follows. First, we choose a random graph $G_{0}^{-}\sim\mathbb{G}(\operatorname{U}(\mathcal{G})\!\setminus\!W,p)$ on the vertex set $\operatorname{U}(\mathcal{G})\!\setminus\!W$ . Then, choose random subsets $S_{v}\subseteq\operatorname{U}(\mathcal{G})\!\setminus\!W$ for each vertex $v\in W$ by taking each element of $\operatorname{U}(\mathcal{G})\!\setminus\!W$ independently with probability $p$ (and choose all these sets $S_{v}$ independently of each other and independent of $G_{0}^{-}$ ). Now, take $G_{0}$ to be the graph on the vertex set $\operatorname{U}(\mathcal{G})$ obtained by starting with the union of the edges of the graphs $G_{0}^{-}$ and $G_{0}[W]$ , and adding edges between each vertex $v\in W$ and all the vertices in $S_{v}\subseteq\operatorname{U}(\mathcal{G})\!\setminus\!W$ .

Let $\mathcal{S}=(S_{v})_{v\in W}$ be the collection of random sets chosen above. Note that the random choice of $\mathcal{S}$ is precisely what is described in Definition 4.8 for the restricted colour system $\mathcal{G^{\prime}}$ (recall that $W$ is the set of vertices of colour $g$ in the restricted colour system $\mathcal{G^{\prime}}$ ).

Then, for any outcome of $\mathcal{S}$ and $G_{0}^{-}\sim\mathbb{G}(\operatorname{U}(\mathcal{G})\!\setminus\!W,p)$ , and any $(j_{1},\dots,j_{g-1})\in[t_{1}]\times\dots\times[t_{g-1}]$ , the graph $\mathcal{G}(G_{0},j_{1},\dots,j_{g-1})$ is the same as the graph $\mathcal{G^{\prime}_{S}}(G_{0}^{-},j_{1},\dots,j_{g-1},1)$ . (Here $\mathcal{G^{\prime}_{S}}$ is the colour system obtained from the restricted colour system $\mathcal{G^{\prime}}$ as in Definition 4.8.) Recall that by definition $\psi_{H}(\mathcal{G},G_{0},j_{1},\dots,j_{g-1})$ is the number of labelled copies of $H$ in the graph $\mathcal{G}(G_{0},j_{1},\dots,j_{g-1})$ which use at least one vertex of each of the colours $1,\dots,g-1$ . If such a copy of $H$ contains at least one vertex in $W$ , then it is one of the $\psi_{H}(\mathcal{G^{\prime}_{S}},G_{0}^{-},j_{1},\dots,j_{g-1},1)$ labelled copies of $H$ in the graph $\mathcal{G^{\prime}_{S}}(G_{0}^{-},j_{1},\dots,j_{g-1},1)=\mathcal{G}(G_{0},j_{1},\dots,j_{g-1})$ which use at least one vertex of each of the $g$ colours of $\mathcal{G}^{\prime}$ . On the other hand, if it does not contain a vertex of $W$ , then it is one of the $\psi_{H}(\mathcal{G}^{-},G_{0}^{-},j_{1},\dots,j_{g-1})$ labelled copies of $H$ in the graph $\mathcal{G^{-}}(G_{0}^{-},j_{1},\dots,j_{g-1})=\mathcal{G}(G_{0},j_{1},\dots,j_{g-1})\!\setminus\!W$ which use at least one vertex of each of the colours $1,\dots,g-1$ . Thus, for any outcome of $\mathcal{S}$ and $G_{0}^{-}$ , we have

[TABLE]

For random $\mathcal{S}=(S_{v})_{v\in W}$ and $G_{0}^{-}\sim\mathbb{G}(\operatorname{U}(\mathcal{G})\!\setminus\!W,p)$ as above, it therefore suffices to prove that

[TABLE]

Let $\delta:\mathbb{N}\to\mathbb{R}_{\geq 0}$ be the function obtained by applying Proposition 4.10 with parameters $g,a_{1},\dots,a_{g}$ , $t_{1},\dots,t_{g-1}$ (recall that we are assuming that Proposition 4.10 holds for $g$ ). So $\delta(n)=o(1)$ . Call an outcome of $(\mathcal{S},G_{0}^{-})$ typical if the following two conditions hold.

(a)

For $q=n^{(g-h+(1/2))\cdot T+\delta(n)}$ , the graph $G_{0}^{-}$ is $(p,q,\mathcal{G^{\prime}})$ -dispersed;

(b)

we have $\mathopen{}\mathclose{{}\left\|\psi_{H}(\mathcal{G^{\prime}_{S}},G_{0}^{-},\cdot)-\mu_{\mathcal{G^{\prime}}}}\right\|_{\infty}\leq n^{h-g-(1/2)}\cdot\log n$ .

Claim 7.4.

$(\mathcal{S},G_{0}^{-})$ * is typical with probability $1-n^{-\omega(1)}$ .*

Proof.

Recall that $\mathcal{G}^{\prime}$ is an essentially $p$ -general complete restricted colour system of order $n$ with parameters $(g,a_{1},\dots,a_{g-1},a_{g},t_{1},\dots,t_{g-1})$ . Its set of uncoloured vertices is $\operatorname{U}(\mathcal{G})\!\setminus\!W$ . Thus, by Proposition 4.10 (which we assumed to hold for $g$ ), the random graph $G_{0}^{-}\sim\mathbb{G}(\operatorname{U}(\mathcal{G})\!\setminus\!W,p)$ is $(p,q,\mathcal{G^{\prime}})$ -dispersed with probability $1-n^{-\omega(1)}$ . This shows that (a) holds with probability $1-n^{-\omega(1)}$ .

On the other hand, (b) holds with probability $1-n^{-\omega(1)}$ by Lemma 7.2 applied to the restricted colour system $\mathcal{G^{\prime}}$ . ∎

Note that whenever an outcome of $(\mathcal{S},G_{0}^{-})$ is typical (specifically, whenever (b) holds), we cannot have $\psi_{H}(\mathcal{G^{\prime}_{S}},G_{0}^{-},\cdot,1)+\psi_{H}(\mathcal{G}^{-},G_{0}^{-},\cdot)=\lambda$ unless

[TABLE]

(here we used that $n$ is sufficiently large with respect to $a_{g}=2^{a_{1}t_{1}+\dots+a_{g-1}t_{g-1}}$ ).

Consider the function $\lambda^{\prime}:[t_{1}]\times\dots\times[t_{g-1}]\to\mathbb{Z}$ defined by $\lambda^{\prime}(j_{1},\dots,j_{g-1})=\lambda(j_{1},\dots,j_{g-1})-\mu_{\mathcal{G^{\prime}}}(j_{1},\dots,j_{g-1},1)$ . By Lemma 7.3 applied with the function $\lambda^{\prime}$ and the weakly $p$ -general complete colour system $\mathcal{G}^{-}$ of order $n-a_{g}$ with parameters $(g-1,a_{1},\dots,a_{g-1},t_{1},\dots,t_{g-1})$ , we see that Equation 7.3 holds with probability at most $(n-a_{g})^{-T/2+o(1)}=n^{-T/2+o(1)}$ .

Note that both Equation 7.3 and (a) only depend on the random choice of $G_{0}^{-}$ and not on the random choice of $\mathcal{S}$ . For any outcome of $G_{0}^{-}$ such that Equation 7.3 and (a) hold, the conditional probability of the event that $\psi_{H}(\mathcal{G^{\prime}_{S}},G_{0}^{-},\cdot,1)=\lambda-\psi_{H}(\mathcal{G}^{-},G_{0}^{-},\cdot)$ is at most $q=n^{(g-h+(1/2))\cdot T+\delta(n)}$ . (This follows directly from the definition of being $(p,q,\mathcal{G^{\prime}})$ -dispersed; see Definition 4.9).

Thus, the total probability that $(G_{0}^{-},\mathcal{S})$ is typical and satisfies $\psi_{H}(\mathcal{G^{\prime}_{S}},G_{0}^{-},\cdot,1)+\psi_{H}(\mathcal{G}^{-},G_{0}^{-},\cdot)=\lambda$ is at most

[TABLE]

recalling that $\delta(n)=o(1)$ . By Claim 7.4, the probability that $(G_{0}^{-},\mathcal{S})$ is not typical is $n^{-\omega(1)}$ , so the probability in Equation 7.2 is at most $n^{(g-h)\cdot T+o(1)}+n^{-\omega(1)}=n^{(g-h)\cdot T+o(1)}$ . This finishes the proof of Theorem 4.6 for $g-1$ .

8 Cores and non-degeneracy

It remains to prove Lemmas 6.3 and 7.3. Both of these lemmas will be proved using our new multivariate anticoncentration inequality (Theorem 3.1), which requires us to check a non-degeneracy condition for a certain collection of vectors. In this section we introduce the formalism of cores, which are special types of colour systems of bounded size. For a core $\mathcal{C}$ , we will define certain functions $\Gamma_{\mathcal{C},e}$ (which may be interpreted as belonging to a vector space of functions), and we show that under certain conditions these functions span the entire space.

The point of this section is that in the settings of both Lemma 6.3 and Lemma 7.3, the collections of vectors that we need to study to apply Theorem 3.1 are in correspondence with collections of functions $\Gamma_{\mathcal{C},e}$ , for appropriately chosen cores $\mathcal{C}$ . In the next section (Section 9) we will specify how to actually choose the cores for these correspondences. Throughout this section, fix any $0\leq g\leq h-1$ (recall that $h$ is the number of vertices of our fixed graph $H$ ).

Definition 8.1.

For integers $0\leq g\leq h-1$ and $a_{1},\dots,a_{g},t_{1},\dots,t_{g-1}\geq 1$ , a core $\mathcal{C}$ with parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g-1})$ is a colour system with parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g-1},1)$ which has exactly one uncoloured vertex, such that the uncoloured vertex has edges to all coloured vertices in all possible shades (meaning that for each $1\leq i\leq g$ , the uncoloured vertex has edges in all $t_{i}$ shades of colour $i$ to all vertices of colour $i$ ).

Note that for fixed parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g-1})$ there are only finitely many different cores $\mathcal{C}$ with these parameters.

A core $\mathcal{C}$ with parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g-1})$ is called complete if it is complete when viewed as a colour system with parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g-1},1)$ .

Definition 8.2.

A partial copy of $H$ in a core $\mathcal{C}$ with parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g-1})$ is a labelled copy of a $(g+1)$ -vertex induced subgraph of $H$ which uses one vertex of each colour in $\mathcal{C}$ and the uncoloured vertex. More formally, a partial copy of $H$ in a core $\mathcal{C}$ is given by a graph homomorphism $\phi:H[V^{\prime}]\to\mathcal{C}$ for some subset $V^{\prime}\subseteq V(H)$ of size $g+1$ such that the image of $\phi$ contains one vertex of each colour and the uncoloured vertex.

Note that the homomorphism $\phi:H[V^{\prime}]\to\mathcal{C}$ in Definition 8.2 is automatically injective on vertices (and therefore also on edges).

Definition 8.3.

Consider a core $\mathcal{C}$ with parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g-1})$ . For a subset $V^{\prime}\subseteq V(H)$ of size $g+1$ , define the weight of a partial copy $\phi:H[V^{\prime}]\to\mathcal{C}$ of $H$ to be $p^{e(H)-e_{H}(V^{\prime})}$ . Also, for a $g$ -tuple $(j_{1},\dots,j_{g})\in[t_{1}]\times\dots\times[t_{g-1}]\times[1]$ , say that a partial copy $\phi:H[V^{\prime}]\to\mathcal{C}$ of $H$ in $\mathcal{C}$ is $(j_{1},\dots,j_{g})$ -coloured if for each $1\leq i\leq g$ all the edges of colour $i$ in the image of $\phi$ have shade $j_{i}$ .

Note that if a partial copy $\phi:H[V^{\prime}]\to\mathcal{C}$ of $H$ in $\mathcal{C}$ does not contain any edges of colour $i$ in its image, then it can be $(j_{1},\dots,j_{g})$ -coloured for different values of $j_{i}$ . However, if $\phi$ has at least one edge of colour $i$ in its image, then it can only be $(j_{1},\dots,j_{g})$ -coloured if $j_{i}$ is the shade of the edges of colour $i$ in the image of $\phi$ (if there are different edges of colour $i$ with different shades in the image of $\phi$ , then $\phi$ is not $(j_{1},\dots,j_{g})$ -coloured for any $(j_{1},\dots,j_{g})$ ).

Definition 8.4.

For a core $\mathcal{C}$ with parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g-1})$ and a collection of edges $F\subseteq E(\mathcal{C})$ , let $\Gamma_{\mathcal{C},F}:[t_{1}]\times\dots\times[t_{g-1}]\times[1]\to\mathbb{R}$ be the function defined as follows. For all $(j_{1},\dots,j_{g})\in[t_{1}]\times\dots\times[t_{g-1}]\times[1]$ , let $\Gamma_{\mathcal{C},F}(j_{1},\dots,j_{g})$ be the sum of the weights of all $(j_{1},\dots,j_{g})$ -coloured partial copies of $H$ in $\mathcal{C}$ whose image contains all edges in $F$ . If $F$ just consists of one edge $e$ , we write $\Gamma_{\mathcal{C},e}$ instead of $\Gamma_{\mathcal{C},\{e\}}$ .

Note that we have $\Gamma_{\mathcal{C},F}(j_{1},\dots,j_{g})=0$ if for some $1\leq i\leq g$ the set $F$ contains an edge of colour $i$ with a shade distinct from $j_{i}$ , because then there are no $(j_{1},\dots,j_{g})$ -coloured partial copies of $H$ in $\mathcal{C}$ whose image contains this edge.

Now, the main result of this section is as follows, showing that the functions $\Gamma_{\mathcal{C},e}$ satisfy a non-degeneracy condition.

Lemma 8.5.

Let $\mathcal{C}$ be a complete core with parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g-1})$ . Consider the functions $\Gamma_{\mathcal{C},e}$ , where $e$ ranges over all edges between the uncoloured vertex of $\mathcal{C}$ and a vertex of colour $g$ . Then these functions span the real vector space of all functions $[t_{1}]\times\dots\times[t_{g-1}]\times[1]\to\mathbb{R}$ .

8.1 Proof of Lemma 8.5

For this subsection, fix a core $\mathcal{C}$ with parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g-1})$ . Let $L$ be the span of the functions $\Gamma_{\mathcal{C},e}$ , for all edges $e$ between the uncoloured vertex of $\mathcal{C}$ and a vertex of colour $g$ . Note that $L$ is a subspace of the real vector space of all functions $[t_{1}]\times\dots\times[t_{g-1}]\times[1]\to\mathbb{R}$ . Our goal is to show that $L$ is actually the entire space of functions $[t_{1}]\times\dots\times[t_{g-1}]\times[1]\to\mathbb{R}$ .

First, let us make some more definitions.

Definition 8.6.

For an integer $1\leq b\leq g$ , a downward tree of size $b$ in $\mathcal{C}$ is a collection $F\subseteq E(\mathcal{C})$ of $b$ edges of colours $g-b+1,\dots,g$ which form a tree containing the uncoloured vertex of $\mathcal{C}$ as a leaf.

Although formally a downward tree $F$ is just a collection of edges, we say that $F$ contains a vertex of $\mathcal{C}$ if this vertex is part of the tree formed by the edges in $F$ . For an example of a downward tree, see Figure 1.

Lemma 8.7.

For any integer $1\leq b\leq g$ , every downward tree $F$ of size $b$ in $\mathcal{C}$ contains the uncoloured vertex and exactly one vertex in each of the colours $g-b+1,\dots,g$ (and no vertices in the colours $1,\dots,g-b$ ). Furthermore, the vertex of colour $g-b+1$ is a leaf in the tree $F$ . Finally, if $b\geq 2$ and $e$ is the unique edge of $F$ incident to the vertex of colour $g-b+1$ , then $e$ is not incident to the uncoloured vertex and $F\setminus\{e\}$ is a downward tree of size $b-1$ .

Proof.

Since $F$ has an edge in each of the colours $g-b+1,\dots,g$ , it must also have a vertex in each of these colours (recall that in a colour system an edge can only have colour $i$ if it is incident to a vertex of colour $i$ ). Thus, $F$ contains at least one vertex in each of the colours $g-b+1,\dots,g$ and by definition it also contains the uncoloured vertex. As $F$ has only $b+1$ vertices, this establishes the first part of the lemma.

For the second part we need to show that there is only one edge of $F$ incident to the vertex of colour $g-b+1$ . However, note that each edge of $F$ incident to the vertex of colour $g-b+1$ has colour $g-b+1$ (because the other vertex of each such edge is either uncoloured or has a colour with a number larger than $g-b+1$ ). As $F$ has only one edge of colour $g-b+1$ , there is indeed only one edge of $F$ incident to the vertex of colour $g-b+1$ .

Finally, for the third part, note that the edge $e$ has colour $g-b+1$ (by the argument above), so $F\setminus\{e\}$ is a tree with $b-1$ edges of colours $g-b+2,\dots,g$ . The edge $e$ is not incident to the uncoloured vertex, as otherwise both vertices of $e$ would be leaves in $F$ which would contradict $b\geq 2$ . In particular, the uncoloured vertex is still a leaf of the tree $F\setminus\{e\}$ . Hence $F\setminus\{e\}$ is indeed a downward tree of size $b-1$ . ∎

Lemma 8.8.

For every downward tree $F\subseteq E(\mathcal{C})$ , we have $\Gamma_{\mathcal{C},F}\in L$ .

Proof.

We prove the lemma by induction on $b$ .

If $b=1$ , then $F=\{e\}$ for a single edge $e$ . By definition, the uncoloured vertex is a leaf of $F$ , so $e$ is incident to the uncoloured vertex. Furthermore, $e$ has colour $g$ , so it is also incident to a vertex of colour $g$ . We therefore trivially have $\Gamma_{\mathcal{C},e}\in L$ , by the definition of $L$ .

Now, let us assume that $b\geq 2$ and that the lemma statement holds for $b-1$ . Let $F$ be a downward tree of size $b$ . Recall that by Lemma 8.7, $F$ contains precisely one vertex $v^{*}$ of colour $g-b+1$ and this vertex is a leaf in $F$ . So let $e^{*}$ be the unique edge in $F$ incident to $v^{*}$ . Then by the last part of Lemma 8.7, $F\setminus\{e^{*}\}$ is a downward tree of size $b-1$ . Finally, let $w$ be the other vertex of $e^{*}$ , so that (again by Lemma 8.7) $w$ is coloured with one of the colours $g-b+2,\dots,g$ .

Applying Lemma 8.7 to the downward tree $F\setminus\{e^{*}\}$ , we can label the vertices of $F\setminus\{e^{*}\}$ as $v_{g-b+2},\dots,v_{g},u$ , such that $u$ is the uncoloured vertex of $\mathcal{C}$ and such that each vertex $v_{i}$ has colour $i$ . Note that $w$ is one of the vertices $v_{g-b+2},\dots,v_{g}$ .

Now, since $\mathcal{C}$ is complete (recall Definition 4.4), we can recursively define vertices $v^{\prime}_{g-b+2},\dots,v^{\prime}_{g}$ in $\mathcal{C}$ , with colours $g-b+2,\dots,g$ respectively, such that the following three conditions are satisfied.

•

For all $g-b+2\leq j<i\leq g$ , the vertices $v^{\prime}_{i}$ and $v^{\prime}_{j}$ are connected by edges of exactly the same shades of colour $j$ as the vertices $v_{i}$ and $v_{j}$ .

•

For all $g-b+2\leq i\leq g$ and all vertices $v$ of $\mathcal{C}$ of any of the colours $1,\dots,g-b+1$ such that $(v_{i},v)\neq(w,v^{*})$ , the vertices $v^{\prime}_{i}$ and $v$ are connected by edges of exactly the same shades of the colour of $v$ as the vertices $v_{i}$ and $v$ .

•

For the index $g-b+2\leq i\leq g$ such that $v_{i}=w$ , the vertices $v^{\prime}_{i}$ and $v^{*}$ are connected by edges of exactly the same shades of the of colour $g-b+1$ as the vertices $v_{i}$ and $v^{*}$ except that there is no edge between $v^{\prime}_{i}$ and $v^{*}$ with the shade of $e^{*}$ .

Informally speaking, these conditions are saying that the shade of the edges between any two of the vertices $v^{\prime}_{g-b+2},\dots,v^{\prime}_{g}$ and the shades of the edges between the vertices $v^{\prime}_{g-b+2},\dots,v^{\prime}_{g}$ and the vertices of colours $1,\dots,g-b+1$ are the same as the corresponding shades for $v_{g-b+2},\dots,v_{g}$ except that between the vertices $w^{\prime}$ and $v^{*}$ the shade of the edge $e^{*}$ between $w$ and $v^{*}$ is missing. Here, $w^{\prime}$ denotes the vertex $v^{\prime}_{i}$ for the index $g-b+2\leq i\leq g$ such that $v_{i}=w$ .

Now, for each edge $e\in F\setminus\{e^{*}\}$ with endpoints $v_{i}$ and $v_{j}$ , for $g-b+2\leq j<i\leq g$ , there exists an edge $e^{\prime}\in E(\mathcal{C})$ between $v^{\prime}_{i}$ and $v^{\prime}_{j}$ such that $e^{\prime}$ has the same shade of colour $j$ as $e$ . Let $F^{\prime}$ be the collection of all these edges together with the unique edge between $v^{\prime}_{g}$ and $u$ (recall that there is only one shade of colour $g$ and that the edge between $v_{g}$ and $u$ is the only edge in $F\setminus\{e^{*}\}$ incident to $u$ ). Then $F^{\prime}$ forms a tree which is isomorphic to $F\setminus\{e^{*}\}$ and the corresponding edges are coloured the same way. As $F\setminus\{e^{*}\}$ is a downward tree of size $b-1$ , this implies that $F^{\prime}$ is also a downward tree of size $b-1$ . Furthermore, between any two vertices of $F^{\prime}$ there exist edges in exactly the same shades as between the corresponding vertices of $F\setminus\{e^{*}\}$ . Finally, every vertex in one of the colours $1,\dots,g-b+1$ has edges in the same shades to the vertices of $F\setminus\{e^{*}\}$ as to the corresponding vertices of $F^{\prime}$ except that the shade of the edge $e^{*}$ is missing between the vertices $v^{*}$ and $w^{\prime}$ (recall that $e^{*}\in F$ is an edge between $v^{*}$ and $w$ ).

Thus, for every partial copy of $H$ in $\mathcal{C}$ containing $F^{\prime}$ we can form a corresponding partial copy of $H$ in $\mathcal{C}$ containing $F\setminus\{e^{*}\}$ but not containing $e^{*}$ , by simply replacing each of the vertices $v^{\prime}_{g-b+2},\dots,v^{\prime}_{g}$ in the image of the partial copy by $v_{g-b+2},\dots,v_{g}$ (and by replacing each edge in the image of the partial copy by an edge of the same shade between the corresponding vertices). This process is bijective and does not change which shades of which colours occur among the edges in the image of the partial copy. It also does not change the weight of the partial copy. Thus, for every $(j_{1},\dots,j_{g})\in[t_{1}]\times\dots\times[t_{g-1}]\times[1]$ , the quantity $\Gamma_{\mathcal{C},F^{\prime}}(j_{1},\dots,j_{g})$ is equal to the sum of the weights of the $(j_{1},\dots,j_{g})$ -coloured partial copies of $H$ in $\mathcal{C}$ that contain $F\setminus\{e^{*}\}$ but not $e^{*}$ . In other words, we have $\Gamma_{\mathcal{C},F^{\prime}}(j_{1},\dots,j_{g})=\Gamma_{\mathcal{C},F\setminus\{e^{*}\}}(j_{1},\dots,j_{g})-\Gamma_{\mathcal{C},F}(j_{1},\dots,j_{g})$ for all $(j_{1},\dots,j_{g})\in[t_{1}]\times\dots\times[t_{g-1}]\times[1]$ .

Since $F^{\prime}$ and $F\setminus\{e^{*}\}$ are downward trees of size $b-1$ , we have $\Gamma_{\mathcal{C},F^{\prime}},\Gamma_{\mathcal{C},F\setminus\{e^{*}\}}\in L$ by the induction assumption. Hence $\Gamma_{\mathcal{C},F}=\Gamma_{\mathcal{C},F\setminus\{e^{*}\}}-\Gamma_{\mathcal{C},F^{\prime}}\in L$ , as desired. ∎

Now, note that each downward tree $F\subseteq E(\mathcal{C})$ of size $g$ contains exactly one edge in each of the colours $1,\dots,g$ . For each $1\leq i\leq g$ , let $j_{i}\in[t_{i}]$ be the shade of the edge of colour $i$ in $F$ . Then for any tuple $(j^{\prime}_{1},\dots,j^{\prime}_{g})\in[t_{1}]\times\dots\times[t_{g-1}]\times[1]$ with $(j^{\prime}_{1},\dots,j^{\prime}_{g})\neq(j_{1},\dots,j_{g})$ we have $\Gamma_{\mathcal{C},F}(j^{\prime}_{1},\dots,j^{\prime}_{g})=0$ , since by definition there cannot be any $(j^{\prime}_{1},\dots,j^{\prime}_{g})$ -coloured partial copies of $H$ containing $F$ . That is to say, $\Gamma_{\mathcal{C},F}\in L$ is a (possibly zero) multiple of the indicator function of $(j_{1},\dots,j_{g})$ . Since the set of indicator functions of each $(j_{1},\dots,j_{g})$ span the space of all functions $[t_{1}]\times\dots\times[t_{g-1}]\times[1]\to\mathbb{R}$ , it suffices to prove the following lemma in order to finish the proof of Lemma 8.5.

Lemma 8.9.

For every $(j_{1},\dots,j_{g})\in[t_{1}]\times\dots\times[t_{g-1}]\times[1]$ there exists a downward tree $F$ of size $g$ in $\mathcal{C}$ such that $\Gamma_{\mathcal{C},F}(j_{1},\dots,j_{g})>0$ .

Proof.

Fix $(j_{1},\dots,j_{g})\in[t_{1}]\times\dots\times[t_{g-1}]\times[1]$ . Recall that $\Gamma_{\mathcal{C},F}(j_{1},\dots,j_{g})$ is the sum of the weights of all $(j_{1},\dots,j_{g})$ -coloured partial copies of $H$ in $\mathcal{C}$ whose image contains all edges in $F$ . Since each partial copy of $H$ in $\mathcal{C}$ has positive weight, it suffices to show that there exists a $(j_{1},\dots,j_{g})$ -coloured partial copy of $H$ in $\mathcal{C}$ which contains some downward tree $F$ of size $g$ .

Since $\mathcal{C}$ is complete, we can recursively choose vertices $v_{1},\dots,v_{g}$ in $\mathcal{C}$ such that for each $1\leq i\leq n$ , the vertex $v_{i}$ has colour $i$ and for any $1\leq j<i\leq g$ there are edges of all $t_{j}$ shades of colour $j$ between $v_{j}$ and $v_{i}$ . Also, since $H$ is connected, and has $h\geq g+1$ vertices, we can choose a $g$ -edge subtree $F_{H}$ of $H$ . Let $V^{\prime}\subseteq V(H)$ be the set of vertices of this subtree $F_{H}$ and note that $|V^{\prime}|=g+1$ . Choose one leaf of $F_{H}$ and call it $u$ . Now, define $\phi:V^{\prime}\to V(\mathcal{C})$ by mapping $u$ to the uncoloured vertex of $\mathcal{C}$ and mapping the remaining vertices of $F_{H}$ to $v_{1},\dots,v_{g}$ in order of decreasing distance to $u$ in the tree $F_{H}$ (this means the vertex of $F_{H}$ with maximum distance from $u$ in the tree $F_{H}$ will be mapped to $v_{1}$ , the vertex with second-largest distance to $v_{2}$ , and so on), where we break ties arbitrarily.

Note that the image of $\phi:V^{\prime}\to V(\mathcal{C})$ contains one vertex of each colour (the vertices $v_{1},\dots,v_{g}$ ) and the uncoloured vertex. By the choice of $v_{1},\dots,v_{g}$ , we can extend $\phi$ to a $(j_{1},\dots,j_{g})$ -coloured partial copy of $H$ in $\mathcal{C}$ (we have already defined the way $\phi$ maps the relevant vertices of $H$ , we just need to define the way it maps edges). Let $F=\phi(F_{H})$ be the image of our subtree $F_{H}$ of $H$ . We can check that $F$ is a downward tree of size $g$ . ∎

9 Proofs of Lemmas 6.3 and 7.3

In this section we finally prove Lemmas 6.3 and 7.3, using our new anticoncentration inequality in Section 3 and the functions $\Gamma_{\mathcal{C},e}$ defined in Section 8. In Section 9.1 we make some definitions and state some auxiliary lemmas. Most importantly, we explain how to define cores $\mathcal{C}$ in such a way that the functions $\Gamma_{\mathcal{C},e}$ represent the typical effects of changing the status of certain edges. In Sections 9.2 and 9.3 we prove the auxiliary lemmas, and we put everything together in Section 9.4.

9.1 Preparations

First, we define functions $\kappa_{H}(\mathcal{G},G_{0},\cdot,u,v)$ , measuring the change to $\psi_{H}(\mathcal{G},G_{0},\cdot)$ that results from changing an edge $uv$ in $G_{0}$ . In our proof of Lemma 7.3, taking $u,v\in\operatorname{U}(\mathcal{G})$ , these functions will correspond to the $\Delta_{i}\boldsymbol{f}$ that appear in the statement of Theorem 3.1. Recall the definition of the graph $\mathcal{G}(G_{0},j_{1},\dots,j_{g})$ from Definition 4.5.

Definition 9.1.

Let $\mathcal{G}$ be a colour system with parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g})$ . Then, given two distinct vertices $u$ and $v$ in $\mathcal{G}$ , a $g$ -tuple $(j_{1},\dots,j_{g})\in[t_{1}]\times\dots\times[t_{g}]$ , and a graph $G_{0}$ on the vertex set $\operatorname{U}(\mathcal{G})$ , let $\kappa_{H}(\mathcal{G},G_{0},j_{1},\dots,j_{g},u,v)$ be the number of labelled copies of $H$ in the graph $\mathcal{G}(G_{0},j_{1},\dots,j_{g})+\{uv\}$ which use the edge $uv$ as well as at least one vertex of each of the $g$ colours. (Here, $\mathcal{G}(G_{0},j_{1},\dots,j_{g})+\{uv\}$ is the graph obtained from $\mathcal{G}(G_{0},j_{1},\dots,j_{g})$ by adding the edge $uv$ if this edge is not already present.)

Next, we need a similar definition for the proof of Lemma 6.3. Recall that Lemma 6.3 concerns functions $\mu_{\mathcal{G},\mathcal{S}}$ , which are obtained by averaging functions of the form $\psi_{H}(\mathcal{G}_{\mathcal{S}},G_{0},\cdot)$ over $G_{0}$ . We will be interested in the effects on $\mu_{\mathcal{G},\mathcal{S}}$ of adding or removing vertices from the “neighbourhood” sets in $\mathcal{S}$ , which is equivalent to changing the status of edges incident to one of the $a_{g}$ vertices of colour $g$ . We define functions $\nu_{\mathcal{G},\mathcal{S},u,v}$ (where $u$ is an uncoloured vertex and $v$ is a vertex of colour $g$ ) to measure the average effects of such changes.

Definition 9.2.

Given a restricted colour system $\mathcal{G}$ with parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g-1})$ where $1\leq g\leq h-1$ , an outcome of the random collection of sets $\mathcal{S}$ in Definition 4.8, a vertex $u\in\operatorname{U}(\mathcal{G})$ , and a vertex $v$ of colour $g$ in $\mathcal{G}$ , let $\nu_{\mathcal{G},\mathcal{S},u,v}:[t_{1}]\times\dots\times[t_{g-1}]\times[1]\to\mathbb{R}$ be the function given by

[TABLE]

for all $(j_{1},\dots,j_{g})\in[t_{1}]\times\dots\times[t_{g-1}]\times[1]$ . Here, $G_{0}\sim\mathbb{G}(\operatorname{U}(\mathcal{G}),p)$ is a random graph on the vertex set $\operatorname{U}(\mathcal{G})$ .

Now, we want to show that the typical values of the $\nu_{\mathcal{G},\mathcal{S},u,v}$ and $\kappa_{H}(\mathcal{G},G_{0},\cdot,u,v)$ can be expressed in terms of functions $\Gamma_{\mathcal{C},e}$ . First, we consider $\nu_{\mathcal{G},\mathcal{S},u,v}$ , for the proof of Lemma 6.3. It will suffice to restrict our attention to the cases where $u$ comes from a very “rich” subset of the uncoloured vertices.

Definition 9.3.

For a restricted colour system $\mathcal{G}$ with parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g-1})$ , let $\operatorname{U^{*}}(\mathcal{G})$ be the set of all uncoloured vertices in $\mathcal{G}$ which are connected to all vertices of the colours $1,\dots,g-1$ in all possible shades of these colours.

When a general position assumption is satisfied, $\operatorname{U^{*}}(\mathcal{G})$ has linear size (by Lemma 5.1), as follows.

Fact 9.4.

Fix integers $g,a_{1},\dots,a_{g},t_{1},\dots,t_{g-1}\geq 1$ . If $\mathcal{G}$ is an essentially $p$ -general restricted colour system of order $n$ which has parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g-1})$ then $|\operatorname{U^{*}}(\mathcal{G})|=\Omega(n)$ .

We remind the reader that (as in the rest of this paper) the asymptotics in 9.4 are as $n\to\infty$ , while $p$ and the parameters of the restricted colour system are treated as fixed constants for all asymptotic notation. Now, the relevant core for Lemma 6.3 is as follows.

Definition 9.5.

Given a restricted colour system $\mathcal{G}$ with parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g-1})$ , we can obtain a core $\mathcal{C}$ with parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g-1})$ as follows. Consider all coloured vertices of $\mathcal{G}$ together with one additional uncoloured vertex which we connect to all coloured vertices by edges in all possible shades. We call $\mathcal{C}$ the core of the restricted colour system $\mathcal{G}$ .

Note that if a restricted colour system $\mathcal{G}$ is complete, then its core is also complete (recall that being complete only depends on the edges between the coloured vertices). The following lemma gives the connection between the functions $\nu_{\mathcal{G},\mathcal{S},u,v}$ and the functions $\Gamma_{\mathcal{C},e}$ . It will be proved in Section 9.2.

Lemma 9.6.

Fix integers $1\leq g\leq h-1$ and $a_{1},\dots,a_{g},t_{1},\dots,t_{g-1}\geq 1$ . Let $\mathcal{G}$ be an essentially weakly $p$ -general restricted colour system of order $n$ with parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g-1})$ . Furthermore, let $\mathcal{S}$ be an outcome of the random collection of sets $\mathcal{S}$ in Definition 4.8, let $u\in\operatorname{U^{*}}(\mathcal{G})$ and let $v$ be a vertex of colour $g$ in $\mathcal{G}$ . Finally, let $\mathcal{C}$ be the core of the restricted colour system $\mathcal{G}$ , and let $e$ be the unique edge from $v$ to the uncoloured vertex in $\mathcal{C}$ (recall that $v$ is a vertex of colour $g$ in $\mathcal{C}$ , and that the colour $g$ only has one shade). Then, if the colour system $\mathcal{G_{S}}$ is $p$ -general, we have

[TABLE]

Next, we turn to the functions $\kappa_{H}(\mathcal{G},G_{0},\cdot,u,v)$ , for the proof of Lemma 7.3. For this, we consider cores of a different type.

Definition 9.7.

Given a colour system $\mathcal{G}$ with parameters $(g-1,a_{1},\dots,a_{g-1},t_{1},\dots,t_{g-1})$ , define its extended core to be the core with parameters $(g,a_{1},\dots,a_{g-1},2^{a_{1}t_{1}+\dots+a_{g-1}t_{g-1}},t_{1},\dots,t_{g-1})$ obtained as follows. Start with all coloured vertices of $\mathcal{G}$ . Now, for each possible choice of subsets $I_{v}\subseteq[t_{j}]$ for each $1\leq j\leq g-1$ and each vertex $v$ of colour $j$ , add a vertex of colour $g$ which is connected to all the vertices $v$ of colours $1,\dots,g-1$ with edges of exactly the shades given by the set $I_{v}$ . Finally, add one uncoloured vertex and connect it to all coloured vertices by edges in all possible shades (including exactly one shade of colour $g$ ).

Note that in Definition 9.7, there are precisely $(2^{t_{1}})^{a_{1}}\dotsm(2^{t_{g-1}})^{a_{g-1}}=2^{a_{1}t_{1}+\dots+a_{g-1}t_{g-1}}$ different choices for all the subsets $I_{v}\subseteq[t_{j}]$ . Thus, $2^{a_{1}t_{1}+\dots+a_{g-1}t_{g-1}}$ vertices of colour $g$ get added and the resulting core indeed has parameters $(g,a_{1},\dots,a_{g-1},2^{a_{1}t_{1}+\dots+a_{g-1}t_{g-1}},t_{1},\dots,t_{g-1})$ . Furthermore, note that if the colour system $\mathcal{G}$ is complete, then its extended core is a complete core.

In a similar way to Lemma 9.6, for the proof of Lemma 7.3 it will suffice to restrict our attention to those $\kappa_{H}(\mathcal{G},G_{0},\cdot,u,v)$ where $u$ and $v$ belong to certain special sets of uncoloured vertices.

Definition 9.8.

Let $\mathcal{G}$ be a colour system with parameters $(g-1,a_{1},\dots,a_{g-1},t_{1},\dots,t_{g-1})$ and let $\mathcal{C}$ be the extended core of $\mathcal{G}$ . Let $E^{g}(\mathcal{C})$ be the set of edges $e$ connecting the uncoloured vertex in $\mathcal{C}$ to some vertex $w$ of colour $g$ in $\mathcal{C}$ (such an edge is uniquely determined by $w$ because colour $g$ only has one shade). For each such $e\in E^{g}(\mathcal{C})$ , we define the subset $U_{e}\subseteq\operatorname{U}(\mathcal{G})$ as follows. Let $U_{e}$ consist of all those uncoloured vertices $v$ in $\mathcal{G}$ such that $v$ is connected to all vertices of $\mathcal{G}$ of colours $1,\dots,g-1$ in precisely the same shades of the corresponding colours in which the vertex $w$ is connected to these vertices in $\mathcal{C}$ . Also, let $U_{*}\subseteq\operatorname{U}(\mathcal{G})$ be the set of all uncoloured vertices $v$ in $\mathcal{G}$ which are connected to all vertices of $\mathcal{G}$ of colours $1,\dots,g-1$ in all possible shades of these colours.

Note that $U_{*}$ is a special case of $U_{e}$ , for the edge $e\in E^{g}(\mathcal{C})$ connecting the uncoloured vertex of $\mathcal{C}$ to the unique vertex of colour $g$ in $\mathcal{C}$ which is connected to all vertices of colours $1,\dots,g-1$ in all possible shades of these colours. Also note that the $2^{a_{1}t_{1}+\dots+a_{g-1}t_{g-1}}$ sets $U_{e}$ , for $e\in E^{g}(\mathcal{C})$ , form a partition of $\operatorname{U}(\mathcal{G})$ . We will need a counterpart of 9.4 (again a simple consequence of Lemma 5.1): when a general position assumption is satisfied, each $U_{e}(\mathcal{G})$ has linear size, as follows.

Fact 9.9.

Fix integers $g,a_{1},\dots,a_{g-1},t_{1},\dots,t_{g-1}\geq 1$ . Let $\mathcal{G}$ be a weakly $p$ -general colour system of order $n$ with parameters $(g-1,a_{1},\dots,\allowbreak a_{g-1},\allowbreak t_{1},\dots,t_{g-1})$ , and let $\mathcal{C}$ be the extended core of $\mathcal{G}$ . Then for every $e\in E^{g}(\mathcal{C})$ , we have $|U_{e}|=\Omega(n)$ . In particular, $|U_{*}|=\Omega(n)$ .

The next lemma will be proved in Section 9.3.

Lemma 9.10.

Fix integers $1\leq g\leq h-1$ and $a_{1},\dots,a_{g-1},t_{1},\dots,t_{g-1}\geq 1$ . Let $\mathcal{G}$ be a weakly $p$ -general colour system of order $n$ with parameters $(g-1,a_{1},\dots,a_{g-1},t_{1},\dots,t_{g-1})$ , and let $\mathcal{C}$ be the extended core of $\mathcal{G}$ . Consider an edge $e\in E^{g}(\mathcal{C})$ , and consider the sets $U_{e}\subseteq\operatorname{U}(\mathcal{G})$ and $U_{*}\subseteq\operatorname{U}(\mathcal{G})$ as defined as in Definition 9.8. Then for any distinct vertices $u\in U_{*}$ and $v\in U_{e}$ the following holds. If we choose a random graph $G_{0}\sim\mathbb{G}(\operatorname{U}(\mathcal{G}),p)$ on the vertex set $\operatorname{U}(\mathcal{G})$ , then with probability $1-n^{-\omega(1)}$ we have

[TABLE]

9.2 Proof of Lemma 9.6

In this subsection we prove Lemma 9.6. Throughout the subsection, fix integers $1\leq g\leq h-1$ and $a_{1},\dots,a_{g},t_{1},\dots,t_{g-1}\geq 1$ (in order to prove Lemma 9.6 for these values). For all asymptotic notation, these fixed values are treated as constants.

As in the statement of Lemma 9.6, let $\mathcal{G}$ be an essentially weakly $p$ -general restricted colour system of order $n$ with parameters $(g,a_{1},\dots,a_{g},t_{1},\dots,t_{g-1})$ . Let $\mathcal{S}$ be an outcome of the random collection of sets $\mathcal{S}$ in Definition 4.8 such that $\mathcal{G_{S}}$ is $p$ -general, let $u\in\operatorname{U^{*}}(\mathcal{G})$ and let $v$ be a vertex of colour $g$ in $\mathcal{G}$ . Finally, let $\mathcal{C}$ be the core of the restricted colour system $\mathcal{G}$ , and let $e\in E^{g}(\mathcal{C})$ be the unique edge from $v$ to the uncoloured vertex in $\mathcal{C}$ . We need to prove that

[TABLE]

Now, let us define slightly modified versions of $\kappa_{H}$ and the $\nu_{\mathcal{G},\mathcal{S},u,v}$ , that are easier to work with. For any outcome of the random graph $G_{0}\sim\mathbb{G}(\operatorname{U}(\mathcal{G}),p)$ and any $(j_{1},\dots,j_{g})\in[t_{1}]\times\dots\times[t_{g-1}]\times[1]$ , let $\kappa_{H}^{\prime}(\mathcal{G},G_{0},j_{1},\dots,j_{g},u,v)$ be the number of labelled copies of $H$ in the graph $\mathcal{G}(G_{0},j_{1},\dots,j_{g})+\{uv\}$ which use the edge $uv$ and which use exactly one vertex of each of the $g$ colours (for $\kappa_{H}$ we considered the number of copies of $H$ which use at least one vertex of each colour). Then, we define $\nu_{\mathcal{G},\mathcal{S},u,v}^{\prime}:[t_{1}]\times\dots\times[t_{g-1}]\times[1]\to\mathbb{R}$ analoguously to $\nu_{\mathcal{G},\mathcal{S},u,v}$ :

[TABLE]

for all $(j_{1},\dots,j_{g})\in[t_{1}]\times\dots\times[t_{g-1}]\times[1]$ .

Note that for any outcome of the random graph $G_{0}\sim\mathbb{G}(\operatorname{U}(\mathcal{G}),p)$ and any $(j_{1},\dots,j_{g})\in[t_{1}]\times\dots\times[t_{g-1}]\times[1]$ the difference $\kappa_{H}(\mathcal{G},G_{0},j_{1},\dots,j_{g},u,v)-\kappa_{H}^{\prime}(\mathcal{G},G_{0},j_{1},\dots,j_{g},u,v)$ is precisely the number of labelled copies of $H$ in the graph $\mathcal{G_{S}}(G_{0},j_{1},\dots,j_{g})+\{uv\}$ which use the edge $uv$ as well as at least one vertex of each of the $g$ colours and which use at least two vertices of the same colour. Each such labelled copy has to use at least $g+1$ of the coloured vertices and also the vertex $u\in\operatorname{U^{*}}(\mathcal{G})\subseteq\operatorname{U}(\mathcal{G})$ . Hence it can use at most $h-g-2$ vertices in $\operatorname{U}(\mathcal{G})\!\setminus\!\{u\}$ . Thus, the number of such labelled copies is always at most $h^{g+2}\cdot(a_{1}+\dots+a_{g}+1)^{g+2}\cdot n^{h-g-2}=O(n^{h-g-2})$ . It follows that

[TABLE]

So, to prove Lemma 9.6 it suffices to prove the following lemma.

Lemma 9.11.

We have

[TABLE]

Proof.

We need to show that for every $(j_{1},\dots,j_{g})\in[t_{1}]\times\dots\times[t_{g-1}]\times[1]$ we have

[TABLE]

So let us fix some $(j_{1},\dots,j_{g})\in[t_{1}]\times\dots\times[t_{g-1}]\times[1]$ . For every $i=1,\dots,g$ let us refer to shade $j_{i}$ of colour $i$ as the desired shade of colour $i$ .

Recall that $\nu_{\mathcal{G},\mathcal{S},u,v}^{\prime}(j_{1},\dots,j_{g})=\mathbb{E}_{G_{0}}[\kappa_{H}^{\prime}(\mathcal{G_{S}},G_{0},j_{1},\dots,j_{g},u,v)]$ is the expected number of labelled copies of $H$ in the graph $\mathcal{G_{S}}(G_{0},j_{1},\dots,j_{g})+\{uv\}$ which use exactly one vertex of each of the $g$ colours and use the edge $uv$ . We organise these copies by how they interact with the coloured vertices, as follows.

Consider any graph homomorphism $\phi:H[V_{\phi}]\to\mathcal{G_{S}}(G_{0},j_{1},\dots,j_{g})+\{uv\}$ , such that $V_{\phi}$ is a subset of $g+1$ vertices of $H$ and such that the image of $\phi$ contains exactly one vertex of each of the $g$ colours and the edge $uv$ (let $\Phi$ be the set of all such homomorphisms). Let $E_{\phi}$ be the expected number of labelled copies of $H$ in the graph $\mathcal{G_{S}}(G_{0},j_{1},\dots,j_{g})+\{uv\}$ that extend $\phi$ by mapping the vertices in $V\!\setminus\!V_{\phi}$ into $\operatorname{U}(\mathcal{G})\!\setminus\!\{u\}$ (where the expectation is taken over the random choice of $G_{0}\sim\mathbb{G}(\operatorname{U}(\mathcal{G}),p)$ ). Then we have

[TABLE]

Now, since $\mathcal{G_{S}}$ is $p$ -general, we can estimate the $E_{\phi}$ as follows.

Claim 9.12.

For each $\phi\in\Phi$ as above, we have

[TABLE]

Proof.

Let $V_{\text{col}}$ be the set of those vertices $x\in V_{\phi}$ such that $\phi(x)$ is a coloured vertex (i.e. $\phi(x)\neq u$ ). So, $|V_{\text{col}}|=g$ . Recall that $E_{\phi}$ is the expected number of labelled copies of $H$ in the graph $\mathcal{G_{S}}(G_{0},j_{1},\dots,j_{g})+\{uv\}$ that extend $\phi$ by mapping the vertices in $V\!\setminus\!V_{\phi}$ into $\operatorname{U}(\mathcal{G})\!\setminus\!\{u\}$ . For every vertex $y\in V(H)\!\setminus\!V_{\phi}$ , let $M_{\phi}(y)$ be the set of possible choices for the image of $y$ that are compatible with the map $\phi$ on $H[V_{\phi}]$ . More precisely, $M_{\phi}(y)$ is the set of vertices $w\in\operatorname{U}(\mathcal{G})\!\setminus\!\{u\}$ such that for every neighbour $x\in V_{\text{col}}$ of $y$ , the vertex $w$ is connected to $\phi(x)$ in the desired shade of the colour of the vertex $\phi(x)$ . Let $N$ be the number of $(h-g-1)$ -tuples in $\prod_{y\in V(H)\!\setminus\!V_{\phi}}M_{\phi}(y)$ whose vertices are distinct (that is, the number of ways to choose a distinct vertex from each $M_{\phi}(y)$ ). Then $N=\prod_{y\in V(H)\!\setminus\!V_{\phi}}|M_{\phi}(y)|+O(n^{h-g-2})$ , and

[TABLE]

Indeed, if we choose possible images for all the vertices $y\in V(H)\!\setminus\!V_{\phi}$ (there are $N$ such choices), then each of the $e_{H}(V(H)\!\setminus\!V_{\text{col}})$ edges of $H$ inside $V(H)\!\setminus\!V_{\text{col}}$ needs to be mapped to an edge of $G_{0}\sim\mathbb{G}(|\operatorname{U}(\mathcal{G})|,p)$ and the probability for this to happen is $p^{e_{H}(V(H)\!\setminus\!V_{\text{col}})}$ .

Now, the sizes of the $M_{\phi}(y)$ are dictated by our assumption that $\mathcal{G_{S}}$ is $p$ -general. Fix a vertex $y\in V(H)\!\setminus\!V_{\phi}$ and let $x_{1},\dots,x_{k}$ be its neighbours in $V_{\text{col}}$ . Let $N_{1},\dots,N_{k}\subseteq\operatorname{U}(\mathcal{G})$ be the neighbourhoods of the vertices $\phi(x_{1}),\dots,\phi(x_{k})$ in $\operatorname{U}(\mathcal{G})$ in the desired shades of the colours of $\phi(x_{1}),\dots,\phi(x_{k})$ , respectively. Then the set of possible choices for the image of $y$ is $N_{1}\cap\dots\cap N_{k}\!\setminus\!\{u\}$ . So, $|M_{\phi}(y)|$ differs from $\mathopen{}\mathclose{{}\left|N_{1}\cap\dots\cap N_{k}}\right|$ by at most 1. Now, if we consider all the neighbourhoods in $\operatorname{U}(\mathcal{G})$ of all vertices of colours $1,\dots,g$ in $\mathcal{G_{S}}$ in all the respective shades of these colours, then these are $a_{1}t_{1}+\dots+a_{g-1}t_{g-1}+a_{g}$ subsets of the set $\operatorname{U}(\mathcal{G})$ in $(p,3^{g})$ -general position (this is because $\mathcal{G_{S}}$ is by assumption a $p$ -general colour system). Thus, by Lemma 5.1 we have

[TABLE]

Recall that $|\operatorname{U}(\mathcal{G})|=n-a_{1}-\dots-a_{g}=n-O(1)$ , and that $k$ was the number of neighbours in of $y$ in $V_{\text{col}}$ , so

[TABLE]

or equivalently

[TABLE]

Finally, observe that $e_{H}(V(H)\!\setminus\!V_{\text{col}})$ , plus the sum of all the $e_{H}(y,V_{\text{col}})$ , for $y\in V(H)\!\setminus\!V_{\phi}$ , is equal to $e(H)-e_{H}(V_{\phi})$ . Indeed, since $V_{\phi}$ and $V(H)\!\setminus\!V_{\text{col}}$ intersect in only one vertex $z$ , every edge of $H$ is either between two vertices of $V_{\phi}$ , two vertices of $V(H)\!\setminus\!V_{\text{col}}$ , or between a vertex of $V_{\phi}\!\setminus\!\{z\}=V_{\text{col}}$ and a vertex of $(V(H)\!\setminus\!V_{\text{col}})\!\setminus\!\{z\}=V(H)\!\setminus\!V_{\phi}$ . From Equation 9.3 we therefore conclude that

[TABLE]

which is equivalent to the desired bound. ∎

Now, the sum in Equation 9.2 is only over $|\Phi|\leq h^{g+1}a_{1}\dots a_{g}=O(1)$ choices of $\phi$ , so Claim 9.12 implies that

[TABLE]

Finally, recall that $\Phi$ is the set of homomorphisms of the form $\phi:H[V_{\phi}]\to\mathcal{G_{S}}(G_{0},j_{1},\dots,j_{g})+\{uv\}$ , with $|V_{\phi}|=g+1$ , such that the image of $\phi$ contains exactly one vertex of each of the $g$ colours and the edge $uv$ . The core $\mathcal{C}$ of the restricted colour system $\mathcal{G}$ was defined (in Definition 9.5) in such a way that there is a bijective correspondence between $\Phi$ and the set of $(j_{1},\dots,j_{g})$ -coloured partial copies $\phi^{*}$ of $H$ in $\mathcal{C}$ which contain the edge $e$ . Recall that $\Gamma_{\mathcal{C},e}(j_{1},\dots,j_{g})$ was defined (in Definition 8.4) as the sum of the weights of all $(j_{1},\dots,j_{g})$ -coloured partial copies whose image contains $e$ , and the weight of a partial copy $\phi^{*}:H[V^{\prime}]\to\mathcal{C}$ was defined to be $p^{e(H)-e_{H}(V^{\prime})}$ . So,

[TABLE]

The desired bound Equation 9.1 follows. ∎

9.3 Proof of Lemma 9.10

In this subsection we deduce Lemma 9.10 from Lemma 9.6. Throughout the subsection, fix integers $1\leq g\leq h-1$ and $a_{1},\dots,a_{g-1},t_{1},\dots,t_{g-1}\geq 1$ (in order to prove Lemma 9.10 for these values). For all asymptotic notation, these fixed values are treated as constants.

Let $\mathcal{G}$ be a weakly $p$ -general colour system of order $n$ with parameters $(g-1,a_{1},\dots,a_{g-1},t_{1},\dots,t_{g-1})$ , let $\mathcal{C}$ be the extended core of $\mathcal{G}$ , and consider some $e\in E^{g}(\mathcal{C})$ . Fix distinct vertices $u\in U_{*}$ and $v\in U_{e}$ . We need to show that for a random graph $G_{0}\sim\mathbb{G}(\operatorname{U}(\mathcal{G}),p)$ , with probability $1-n^{-\omega(1)}$ we have

[TABLE]

for all $(j_{1},\dots,j_{g-1})\in[t_{1}]\times\dots\times[t_{g-1}]$ . For the rest of the proof fix some $(j_{1},\dots,j_{g-1})\in[t_{1}]\times\dots\times[t_{g-1}]$ : we will show that Equation 9.4 holds with probability $1-n^{-\omega(1)}$ (then the desired result will follow, taking a union bound over all choices of $(j_{1},\dots,j_{g-1})$ ).

By 9.9, each of the $2^{a_{1}t_{1}+\dots+a_{g-1}t_{g-1}}$ disjoint sets $U_{f}$ has size $\Omega(n)\geq 2$ . Let $Z$ be a set containing one representative from each $U_{f}$ , taking $v\in U_{e}$ , but taking some vertex other than $u$ in $U_{*}$ . If we imagine that the vertices of $Z$ are coloured with colour $g$ , then, by our choice of $Z$ , the coloured vertices of $\mathcal{G}$ together with $Z$ form a colour system which looks the same as the extended core $\mathcal{C}$ of $\mathcal{G}$ except that the uncoloured vertex of $\mathcal{C}$ is missing.

Now, for the rest of the proof, we condition on some outcome of the induced subgraph $G_{0}[Z]$ on the vertices in $Z$ . To apply Lemma 9.6, we define a restricted colour system $\mathcal{G}^{\prime}$ of order $n$ with parameters $(g,a_{1},\dots,a_{g-1},2^{a_{1}t_{1}+\dots+a_{g-1}t_{g-1}},t_{1},\dots,t_{g-1})$ by starting with $\mathcal{G}$ , colouring the vertices in $Z$ with colour $g$ , and including all edges of our conditioned outcome of $G_{0}[Z]$ in a single shade of colour $g$ . By construction, the core $\mathcal{C}^{\prime}$ of the restricted colour system $\mathcal{G}^{\prime}$ is almost isomorphic to $\mathcal{C}$ ; the only difference is that $\mathcal{C}^{\prime}$ has the edges of $G_{0}[Z]$ between the vertices of colour $g$ , whereas $\mathcal{C}$ has no edges of colour $g$ . (To be precise, there is a colour/shade-preserving graph isomorphism between $\mathcal{C}$ and $\mathcal{C}^{\prime}-E(G_{0}[Z])$ ).

Since we chose $Z$ such that $v\in Z$ and $u\not\in Z$ , we have $u\in\operatorname{U}(\mathcal{G}^{\prime})$ (that is, $u$ is uncoloured in $\mathcal{G}^{\prime}$ ), and $v$ has colour $g$ in $\mathcal{G}^{\prime}$ . Recall that $u\in U_{*}$ , meaning that $u$ is connected to all vertices of colours $1,\dots,g-1$ in all possible shades of these colours. This implies $u\in\operatorname{U^{*}}(\mathcal{G}^{\prime})$ . Now, let $e^{\prime}\in E^{g}(\mathcal{C}^{\prime})$ be the unique edge in the core $\mathcal{C}^{\prime}$ between $v$ and the uncoloured vertex of $\mathcal{C}^{\prime}$ . As $v\in U_{e}$ , this edge $e^{\prime}$ corresponds to the edge $e$ in $\mathcal{C}$ under the isomorphism in the previous paragraph. Note that the functions $\Gamma_{\mathcal{C},e}$ do not actually depend on the edges between the vertices of colour $g$ in $\mathcal{C}$ (since the partial copies of $H$ in $\mathcal{C}$ use exactly one vertex of colour $g$ ). Thus, we have $\Gamma_{\mathcal{C},e}=\Gamma_{\mathcal{C}^{\prime},e^{\prime}}$ .

We have conditioned on an outcome of $G_{0}[Z]$ . For each $z\in Z$ , let $S_{z}=N_{G_{0}}(z)\cap\operatorname{U}(\mathcal{G}^{\prime})$ be the (random) set of neighbours of $z$ in $\operatorname{U}(\mathcal{G}^{\prime})=\operatorname{U}(\mathcal{G})\!\setminus\!Z$ , and let $G_{0}^{-}=G_{0}[\operatorname{U}(\mathcal{G}^{\prime})]\sim\mathbb{G}(\operatorname{U}(\mathcal{G}^{\prime}),p)$ be the induced subgraph on $\operatorname{U}(\mathcal{G}^{\prime})$ . Then, with $\mathcal{S}=(S_{z})_{z\in Z}$ we have $\mathcal{G}(G_{0},j_{1},\dots,j_{g-1})=\mathcal{G}^{\prime}_{\mathcal{S}}(G_{0}^{-},j_{1},\dots,j_{g},1)$ in the notation of Definition 4.8. It follows that $\kappa_{H}(\mathcal{G},G_{0},j_{1},\dots,j_{g-1},u,v)=\kappa_{H}(\mathcal{G}^{\prime}_{\mathcal{S}},G_{0}^{-},j_{1},\dots,j_{g-1},1,u,v)$ , because every labelled copy of $H$ using the edge $uv$ as well as at least one vertex of each of the colours $1,\dots,g-1$ automatically also uses a vertex of colour $g$ (namely $v$ ). Thus, Equation 9.4 is equivalent to the inequality

[TABLE]

where $G_{0}^{-}\sim\mathbb{G}(\operatorname{U}(\mathcal{G}^{\prime}),p)$ , and $\mathcal{S}=(S_{z})_{z\in Z}$ is a collection of random sets with respect to the restricted colour system $\mathcal{G}^{\prime}$ as in Definition 4.8.

Note that $\mathcal{G}^{\prime}$ is essentially weakly $p$ -general, due to the way it was defined in terms of $\mathcal{G}$ . So, by Lemma 5.5, $\mathcal{G}^{\prime}_{\mathcal{S}}$ is $p$ -general with probability at least $1-n^{-\omega(1)}$ , and if $\mathcal{G}^{\prime}_{\mathcal{S}}$ is $p$ -general then by Lemma 9.6 we have $\Gamma_{\mathcal{C}^{\prime},e^{\prime}}(j_{1},\dots,j_{g-1},1)n^{h-g-1}=\nu_{\mathcal{G}^{\prime},\mathcal{S},u,v}(j_{1},\dots,j_{g-1},1)+O(\log n\cdot n^{h-g-(3/2)})$ . So, to conclude the proof of Lemma 9.10, it suffices to prove the following claim.

Claim 9.13.

With probability $1-n^{-\omega(1)}$ we have

[TABLE]

Proof.

Condition on a fixed outcome of $\mathcal{S}$ (so that only $G_{0}^{-}$ remains random). Recall that by Definition 9.2 we have $\nu_{\mathcal{G}^{\prime},\mathcal{S},u,v}(j_{1},\dots,j_{g-1},1)=\mathbb{E}_{G_{0}^{-}}[\kappa_{H}(\mathcal{G_{S}^{\prime}},G_{0}^{-},j_{1},\dots,j_{g-1},1,u,v)]$ . Consider the vertex-exposure martingale for $G_{0}^{-}$ (with respect to $\kappa_{H}(\mathcal{G}^{\prime}_{\mathcal{S}},G_{0}^{-},j_{1},\dots,j_{g-1},1,u,v)$ ), where we fix an ordering of $\operatorname{U}(\mathcal{G}^{\prime})$ (ending with $u$ ) and at each step we consider the next vertex in our ordering and expose all the edges of $G_{0}^{-}$ incident to that vertex which have not yet been exposed. Changing the status of edges adjacent to a single vertex in $\operatorname{U}(\mathcal{G}^{\prime})\!\setminus\!\{u\}$ changes the value of $\kappa_{H}(\mathcal{G}^{\prime}_{\mathcal{S}},G_{0}^{-},j_{1},\dots,j_{g-1},1,u,v)$ by at most $h^{g+2}\cdot a_{1}\dotsm a_{g-1}\cdot 2^{a_{1}t_{1}+\dots+a_{g-1}t_{g-1}}\cdot n^{h-g-2}=O(n^{h-g-2})$ . This is due to the fact that there can be at most $h^{g+2}\cdot a_{1}\dotsm a_{g-1}\cdot 2^{a_{1}t_{1}+\dots+a_{g-1}t_{g-1}}\cdot n^{h-g-2}$ different labelled copies of $H$ in the $n$ -vertex graph $\mathcal{G_{S}}(G_{0}^{-},j_{1},\dots,j_{g})+\{uv\}$ which use the edge $uv$ as well as at least one vertex of each of the $g$ colours and the exposed vertex (as both $u$ and the exposed vertex are uncoloured). Thus, by the Azuma–Hoeffding inequality the probability that Equation 9.6 fails to hold is at most

[TABLE]

as desired. ∎

9.4 Putting everything together

Proof of Lemma 7.3.

Recall that the statement of Lemma 7.3 is for fixed integers $1\leq g\leq h-1$ and $a_{1},\dots,a_{g-1},t_{1},\dots,t_{g-1}\geq 1$ (which were fixed throughout Section 7), and recall that $T=t_{1}\dotsm t_{g-1}$ . Let $\mathcal{G}$ and $\lambda$ be as in the statement of the lemma, and let $\mathcal{C}$ be the extended core of $\mathcal{G}$ (which is a complete core). Let $N=\binom{|\operatorname{U}(\mathcal{G})|}{2}=\Theta(n^{2})$ , so that the random choice of $G_{0}\sim\mathbb{G}(\operatorname{U}(\mathcal{G}),p)$ can be encoded by a Bernoulli sequence $\boldsymbol{\xi}\sim\operatorname{Ber}(p)^{N}$ , with one random bit for each of the $N$ possible edges of $G_{0}$ . Abusing notation slightly, we identify the integers $1,\dots,N$ with pairs of vertices in $\operatorname{U}(\mathcal{G})$ , so that we may write $\xi_{\{u,v\}}$ to indicate the random bit that encodes the presence of the edge $\{u,v\}$ .

Now, abusing notation, we index the coordinates of $\mathbb{R}^{T}$ by tuples in $[t_{1}]\times\dots,\times[t_{g-1}]$ (so that we may talk about the $(j_{1},\dots,j_{g-1})$ -coordinate of a vector in $\mathbb{R}^{T}$ ). Let $\boldsymbol{f}:\{0,1\}^{N}\to\mathbb{R}^{T}$ be the vector-valued function defined such that, for $\boldsymbol{\xi}\in\{0,1\}^{N}$ corresponding to a graph $G_{0}$ , the $(j_{1},\dots,j_{g-1})$ -coordinate of $\boldsymbol{f}(\boldsymbol{\xi})$ is $\psi_{H}(\mathcal{G},G_{0},j_{1},\dots,j_{g-1})$ . With this definition, and the notation of Theorem 3.1, the random vector $\Delta_{\{u,v\}}\boldsymbol{f}(\boldsymbol{\xi})$ corresponds to the function $\kappa_{H}(\mathcal{G},G_{0},\cdot,1,u,v)$ .

The plan is to now apply Theorem 3.1 with $d=T$ and $m=2^{a_{1}t_{1}+\dots+a_{g-1}t_{g-1}}$ and with the $\Gamma_{\mathcal{C},e}$ taking the role of the vectors $\boldsymbol{v}_{1},\dots,\boldsymbol{v}_{m}$ . For each edge $e\in E^{g}(\mathcal{C})$ , let $\boldsymbol{\gamma}_{e}\in\mathbb{R}^{T}$ be the vector corresponding to the function $\Gamma_{\mathcal{C},e}(\cdot,1)$ , and let $\boldsymbol{x}\in\mathbb{R}^{T}$ be the vector corresponding to the function $\lambda(\cdot,1)$ . By 9.9, there is some $\varepsilon=\Omega(1)$ such that for each edge $e\in E^{g}(\mathcal{C})$ , there are $\Omega(n^{2})\geq\varepsilon N$ pairs of vertices $\{u,v\}$ with $u\in U_{*},v\in U_{e}$ . Let $I_{e}$ be the set of these pairs $\{u,v\}$ (observe that all the $I_{e}$ are disjoint). By Lemma 9.10, for each $\{u,v\}\in I_{e}$ we have $\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left\|\Delta_{\{u,v\}}\boldsymbol{f}\mathopen{}\mathclose{{}\left(\boldsymbol{\xi}}\right)-s\boldsymbol{\gamma}_{e}}\right\|_{\infty}\geq r}\right)\leq n^{-\omega(1)}$ for $s=n^{h-g-1}$ and some $r=\Theta(n^{h-g-3/2}\log n)$ . Note that $r\sqrt{N\log N}\geq s$ , and by Lemma 8.5, the vectors $\boldsymbol{\gamma}_{e}$ span $\mathbb{R}^{T}$ . We can now apply Theorem 3.1 to obtain

[TABLE]

(The implied constants in the above asymptotic notation may a priori depend on $\mathcal{C}$ , but note that there are only finitely many possibilities for a core with parameters $(g,a_{1},\dots,2^{a_{1}t_{1}+\dots+a_{g-1}t_{g-1}},t_{1},\dots,t_{g-1})$ ). Finally, to conclude the proof we recall that $\mathopen{}\mathclose{{}\left\|\psi_{H}(\mathcal{G},G_{0},\cdot)-\lambda}\right\|_{\infty}=\mathopen{}\mathclose{{}\left\|\boldsymbol{f}\mathopen{}\mathclose{{}\left(\boldsymbol{\xi}}\right)-\boldsymbol{x}}\right\|_{\infty}$ and observe that $r\sqrt{N\log N}=\Theta(n^{h-g-(1/2)}(\log n)^{3/2})\geq n^{h-g-(1/2)}\log n$ for large $n$ . ∎

Proof of Lemma 6.3.

This proof is very similar to the proof of Lemma 7.3. Recall that the statement of Lemma 6.3 is for fixed integers $1\leq g\leq h-1$ and $a_{1},\dots,a_{g},t_{1},\dots,t_{g-1}\geq 1$ (which were fixed throughout Section 6), and recall that $T=t_{1}\dotsm t_{g-1}$ . Let $\mathcal{G}$ and $\lambda$ be as in the statement of the lemma, and let $\mathcal{C}$ be the core of $\mathcal{G}$ . Let $N=a_{g}\cdot|\operatorname{U}(\mathcal{G})|=\Theta(n)$ , so that a random choice of $\mathcal{S}$ as in Definition 4.8 can be encoded by a Bernoulli sequence $\boldsymbol{\xi}\in\operatorname{Ber}(p)^{N}$ , with one random bit for each potential element in each $S_{v}\in\mathcal{S}$ . Abusing notation slightly, we identify $1,\dots,N$ with ordered pairs of vertices: for $u\in\operatorname{U}(\mathcal{G})$ and a vertex $v$ of colour $g$ we write $\xi_{(u,v)}$ for the random bit that encodes the presence of $u$ in $S_{v}$ .

Let $\boldsymbol{f}:\{0,1\}^{N}\to\mathbb{R}^{T}$ be the vector-valued function (with coordinates indexed by $[t_{1}]\times\dots\times[t_{g-1}]$ ) defined such that, for $\boldsymbol{\xi}\in\{0,1\}^{N}$ corresponding to an outcome of $\mathcal{S}$ , the $(j_{1},\dots,j_{g-1})$ -coordinate of $\boldsymbol{f}(\boldsymbol{\xi})$ is $\mu_{\mathcal{G},\mathcal{S}}(j_{1},\dots,j_{g-1},1)$ . Then, $\Delta_{(u,v)}\boldsymbol{f}(\boldsymbol{\xi})$ corresponds to the function $\nu_{\mathcal{G},\mathcal{S},u,v}(\cdot,1)$ . For each $e\in E^{g}(\mathcal{C})$ , let $\boldsymbol{\gamma}_{e}\in\mathbb{R}^{T}$ be the vector corresponding to $\Gamma_{\mathcal{C},e}(\cdot,1)$ , and let $\boldsymbol{x}\in\mathbb{R}^{T}$ be the vector corresponding to $\lambda(\cdot,1)$ .

By 9.4, there is some $\varepsilon=\Omega(1)$ such that $|\operatorname{U^{*}}(\mathcal{G})|\geq\varepsilon n$ . For each edge $e\in E^{g}(\mathcal{C})$ between the uncoloured vertex of $\mathcal{C}$ and some vertex $v$ of colour $g$ , let $I_{e}=\operatorname{U^{*}}(\mathcal{G})\times\{v\}$ . By Lemma 5.5 the colour system $\mathcal{G_{S}}$ is $p$ -general with probability $1-n^{-\omega(1)}$ , in which case, by Lemma 9.6, for each $(u,v)\in I_{e}$ we have $\mathopen{}\mathclose{{}\left\|\Delta_{(u,v)}\boldsymbol{f}\mathopen{}\mathclose{{}\left(\boldsymbol{\xi}}\right)-s\boldsymbol{\gamma}_{e}}\right\|_{\infty}\leq r$ for $s=n^{h-g-1}$ and some $r=\Theta(n^{h-g-(3/2)}\log n)$ . Also, by Lemma 8.5, the vectors $\boldsymbol{\gamma}_{e}$ span $\mathbb{R}^{T}$ .

We can now apply Theorem 3.1 to obtain $\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left\|\boldsymbol{f}\mathopen{}\mathclose{{}\left(\boldsymbol{\xi}}\right)-\boldsymbol{x}}\right\|_{\infty}<r\sqrt{N\log N}}\right)\leq n^{-T/2+o(1)}.$ Finally, to conclude the proof we observe that $r\sqrt{N\log N}=\Theta(n^{h-g-1}(\log n)^{3/2})\geq n^{h-g-1}\log n$ for large $n$ , and recall that $\mathopen{}\mathclose{{}\left\|\mu_{\mathcal{G},\mathcal{S}}-\lambda}\right\|_{\infty}=\mathopen{}\mathclose{{}\left\|\boldsymbol{f}\mathopen{}\mathclose{{}\left(\boldsymbol{\xi}}\right)-\boldsymbol{x}}\right\|_{\infty}$ . ∎

10 Concluding remarks

We have proved that for connected $H$ and constant $p\in(0,1)$ , we have $\max_{x}\Pr(X_{H}=x)\leq n^{1-v(H)+o(1)}$ . There are several interesting directions for future research. Most obviously, Conjecture 1.1 remains open: for connected $H$ we are still a factor of $n^{o(1)}$ away from an optimal bound, and for disconnected $H$ we do not even have a bound that improves as $H$ grows (the best general bound is $\Pr\mathopen{}\mathclose{{}\left(X_{H}=x}\right)\leq\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left|X_{H}-x}\right|\leq n^{v\mathopen{}\mathclose{{}\left(H}\right)-2}}\right)=O\mathopen{}\mathclose{{}\left(1/n}\right)$ , as we mentioned in the introduction). It seems that the ideas in this paper are robust enough to give certain nontrivial bounds (in terms of the size of the largest component of $H$ ) even in the disconnected case, but we have not explored this further.

For certain graphs $H$ , a possible route to a proof of Conjecture 1.1 might be via a local central limit theorem, which one might hope to prove by extending the methods of Gilmer and Kopparty [20], and Berkowitz [6, 7]. Basically, this involves carefully estimating the characteristic function $\varphi(t)=\mathbb{E}e^{itX_{H}}$ , using different arguments for different ranges of $t$ . We remark that $\varphi(1/k)$ is small if the distribution of $X_{H}$ is not too biased mod $k$ , which seems comparable to anticoncentration of $X_{H}$ at “scale” $k$ . So, we wonder whether the ideas in this paper might be helpful for estimating $\varphi$ : recall that our argument proceeds by breaking up $X_{H}$ into a sum of many random variables that fluctuate at different scales. However, we emphasise that local central limit theorems do not seem to be the right path to a proof of Conjecture 1.1 in its full generality: for example, if $H$ is a disjoint union of an edge and a 2-edge path, then the probability that $X_{H}$ is odd is substantially different from the probability that it is even (see [16]), meaning that $X_{H}$ does not obey a local central limit theorem.

Also, let $X_{H}^{\mathrm{hom}}$ be the number of (possibly non-injective) homomorphisms from $H$ into $G\sim\mathbb{G}(n,p)$ . This random variable is very closely related to $X_{H}$ , and we remark that with very minimal changes, one can modify our proof of Theorem 1.2 to prove the corresponding theorem for $X_{H}^{\mathrm{hom}}$ , when $H$ is connected. Interestingly, the homomorphism-counting analogue of Conjecture 1.1 fails dramatically in general: if $H$ is the disjoint union of two copies of a graph $H^{\prime}$ , then $X_{H}^{\mathrm{hom}}=(X_{H^{\prime}}^{\mathrm{hom}})^{2}$ , meaning that $X_{H}$ has the same point probabilities as $X_{H^{\prime}}$ . This means that any proof of Conjecture 1.1 must be sensitive to the difference between subgraph counts and homomorphism counts.

It would also be interesting to consider the “sparse” regime where $p$ is allowed to decay with $n$ . For example, it is known that if $H$ is strictly balanced (see for example [23, Section 3.2]) and $p=p(n)$ is such that $\operatorname{Var}X_{H}=o((\mathbb{E}X_{H})^{2})$ , then $\Pr(X_{H}>0)=1-o(1)$ . Could it be that under these conditions we also have that $\max_{x\in\mathbb{Z}}\Pr(X_{H}=x)=O(1/\sqrt{\operatorname{Var}X_{H}})$ ?

As mentioned in [19] it may also be interesting to study anticoncentration of the number of induced copies $X_{H}^{\prime}$ of a subgraph $H$ in a random graph $\mathbb{G}\mathopen{}\mathclose{{}\left(n,p}\right)$ . (This question was also raised by Meka, Nguyen and Vu [30]). The natural analogue of Conjecture 1.1 is that for a fixed graph $H$ and fixed $p\in\mathopen{}\mathclose{{}\left(0,1}\right)$ , we have

[TABLE]

We remark that the behaviour of $\sqrt{\operatorname{Var}\mathopen{}\mathclose{{}\left(X_{H}^{\prime}}\right)}$ is not entirely trivial: for most values of $p$ it has order $\Theta(n^{h-1})$ , but when $p$ is exactly equal to the edge-density of $H$ it may have order $\Theta(n^{h-3/2})$ or $\Theta(n^{h-2})$ (see [23, Theorem 6.42]).

Finally, it would be interesting to prove similar anticoncentration results in other combinatorial settings. One important example is random subsets of the integers (or other groups): for instance, what can we say about anticoncentration of the number of $k$ -term arithmetic progressions in a random subset of $\{1,\dots,n\}$ ? Arithmetic configuration counts have been an interesting analogue to subgraph counts in a number of other settings, for example in the study of large deviations (both fall in the framework of nonlinear large deviations initiated by Chatterjee and Dembo [12]; see for example [22] and the references therein). Another interesting direction of research would be to consider subgraph counts in random $k$ -uniform hypergraphs, or for other random graph models (for example, the uniform distribution $\mathbb{G}(n,m)$ on graphs with a fixed set of $n$ vertices and exactly $m$ edges).

Remark added in proof. While this paper was under review, Sah and Sahwney [36] proved Conjecture 1.1 for connected $H$ (via a local limit theorem) and disproved Conjecture 1.1 in general.

Acknowledgements. We thank the referees for their careful reading, and for many helpful comments.

Bibliography39

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] N. Alon and J. H. Spencer, The probabilistic method , fourth ed., Wiley Series in Discrete Mathematics and Optimization, John Wiley & Sons, Inc., Hoboken, NJ, 2016.
2[2] F. Augeri, Nonlinear large deviation bounds with applications to traces of Wigner matrices and cycles counts in Erdös-Renyi graphs , Ann. Probab., to appear.
3[3] A. D. Barbour, Poisson convergence and random graphs , Math. Proc. Cambridge Philos. Soc. 92 (1982), no. 2, 349–359.
4[4] A. D. Barbour, M. Karoński, and A. Ruciński, A central limit theorem for decomposable random variables with applications to random graphs , J. Combin. Theory Ser. B 47 (1989), no. 2, 125–145.
5[5] A. Basak and R. Basu, Upper tail large deviations of regular subgraph counts in Erdős-Rényi graphs in the full localized regime , ar Xiv preprint ar Xiv:1912.11410 (2019).
6[6] R. Berkowitz, A quantitative local limit theorem for triangles in random graphs , ar Xiv preprint ar Xiv:1610.01281 (2016).
7[7] R. Berkowitz, A local limit theorem for cliques in G ( n , p ) 𝐺 𝑛 𝑝 G(n,p) , ar Xiv preprint ar Xiv:1811.03527 (2018).
8[8] B. B. Bhattacharya, S. Ganguly, E. Lubetzky, and Y. Zhao, Upper tails and independence polynomials in random graphs , Adv. Math. 319 (2017), 313–347.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Anticoncentration for subgraph counts in random graphs

Abstract

1 Introduction

Conjecture 1.1**.**

Theorem 1.2**.**

1.1 Basic definitions and notation

2 Discussion and main ideas of the proof

3 Anticoncentration for “almost-linear” random vectors

Theorem 3.1**.**

Lemma 3.2**.**

Proof of Theorem 3.1.

Claim 3.3**.**

Proof.

4 Colour systems and the induction hypothesis

Definition 4.1**.**

Definition 4.2**.**

Definition 4.3**.**

Definition 4.4**.**

Definition 4.5**.**

Theorem 4.6**.**

Definition 4.7**.**

Definition 4.8**.**

Definition 4.9**.**

Proposition 4.10**.**

5 Sets in general position

Lemma 5.1**.**

Proof.

Lemma 5.2**.**

Proof.

Lemma 5.3**.**

Proof.

Corollary 5.4**.**

Proof.

Lemma 5.5**.**

Proof.

6 Random neighbourhoods: Theorem 4.6 implies Proposition 4.10

6.1 Preparations

Definition 6.1**.**

Lemma 6.2**.**

Proof.

Lemma 6.3**.**

6.2 Joint anticoncentration

Lemma 6.4**.**

Proof.

Claim 6.5**.**

Proof.

Corollary 6.6**.**

Proof.

6.3 Completing the proof of Proposition 4.10

7 Completing the induction step: Proposition 4.10 for ggg implies Theorem 4.6 for g−1g-1g−1

7.1 Preparations

Definition 7.1**.**

Lemma 7.2**.**

Proof.

Lemma 7.3**.**

7.2 Proof of Theorem 4.6 for g−1g-1g−1

Claim 7.4**.**

Proof.

8 Cores and non-degeneracy

Definition 8.1**.**

Definition 8.2**.**

Definition 8.3**.**

Definition 8.4**.**

Lemma 8.5**.**

8.1 Proof of Lemma 8.5

Definition 8.6**.**

Lemma 8.7**.**

Proof.

Lemma 8.8**.**

Proof.

Lemma 8.9**.**

Proof.

9 Proofs of Lemmas 6.3 and 7.3

9.1 Preparations

Conjecture 1.1.

Theorem 1.2.

Theorem 3.1.

Lemma 3.2.

Claim 3.3.

Definition 4.1.

Definition 4.2.

Definition 4.3.

Definition 4.4.

Definition 4.5.

Theorem 4.6.

Definition 4.7.

Definition 4.8.

Definition 4.9.

Proposition 4.10.

Lemma 5.1.

Lemma 5.2.

Lemma 5.3.

Corollary 5.4.

Lemma 5.5.

Definition 6.1.

Lemma 6.2.

Lemma 6.3.

Lemma 6.4.

Claim 6.5.

Corollary 6.6.

7 Completing the induction step: Proposition 4.10 for $g$ implies Theorem 4.6 for $g-1$

Definition 7.1.

Lemma 7.2.

Lemma 7.3.

7.2 Proof of Theorem 4.6 for $g-1$

Claim 7.4.

Definition 8.1.

Definition 8.2.

Definition 8.3.

Definition 8.4.

Lemma 8.5.

Definition 8.6.

Lemma 8.7.

Lemma 8.8.

Lemma 8.9.

Definition 9.1.

Definition 9.2.

Definition 9.3.

Fact 9.4.

Definition 9.5.

Lemma 9.6.

Definition 9.7.

Definition 9.8.

Fact 9.9.

Lemma 9.10.

Lemma 9.11.

Claim 9.12.

Claim 9.13.