Anticoncentration for subgraph counts in random graphs
Jacob Fox, Matthew Kwan, Lisa Sauermann

TL;DR
This paper establishes near-optimal bounds on the probability that the count of a fixed subgraph in a random graph falls into a specific small value, advancing understanding of small-ball probabilities in random graph theory.
Contribution
It introduces a novel anticoncentration inequality for almost linear random vectors and applies it to bound subgraph count probabilities in ErdĆsâRĂ©nyi graphs.
Findings
Proves that for connected graphs, the probability of exactly x copies is at most n^{1-v(H)+o(1)}.
Develops a new anticoncentration inequality for vectors with near-linear behavior.
Provides a method to analyze small-ball probabilities for subgraph counts.
Abstract
Fix a graph and some , and let be the number of copies of in a random graph . Random variables of this form have been intensively studied since the foundational work of Erd\H{o}s and R\'{e}nyi. There has been a great deal of progress over the years on the large-scale behaviour of , but the more challenging problem of understanding the small-ball probabilities has remained poorly understood until now. More precisely, how likely can it be that falls in some small interval or is equal to some particular value? In this paper we prove the almost-optimal result that if is connected then for any we have . Our proof proceeds by iteratively breaking into different components which fluctuate at "different scales", and relies on a new anticoncentration inequality for random vectors that behaveâŠ
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Anticoncentration for subgraph counts in random graphs
Jacob Fox Department of Mathematics, Stanford University, Stanford, CA 94305. Email: [email protected]. Research supported by a Packard Fellowship and by NSF Career Award DMS-1352121. ââ
Matthew Kwan Department of Mathematics, Stanford University, Stanford, CA 94305. Email: [email protected]. Research supported in part by SNSF project 178493. ââ
Lisa Sauermann School of Mathematics, Institute for Advanced Study, Princeton, NJ 08540. Email: [email protected].
Abstract
Fix a graph and some , and let be the number of copies of in a random graph . Random variables of this form have been intensively studied since the foundational work of ErdĆs and RĂ©nyi. There has been a great deal of progress over the years on the large-scale behaviour of , but the more challenging problem of understanding the small-ball probabilities has remained poorly understood until now. More precisely, how likely can it be that falls in some small interval or is equal to some particular value? In this paper we prove the almost-optimal result that if is connected then for any we have . Our proof proceeds by iteratively breaking into different components which fluctuate at âdifferent scalesâ, and relies on a new anticoncentration inequality for random vectors that behave âalmost linearlyâ.
1 Introduction
Let \mathbb{G}\mathopen{}\mathclose{{}\left(n,p}\right) be the binomial random graph model, where we fix a set of vertices and include each of the possible edges independently with probability . For graphs and , let X_{H}\mathopen{}\mathclose{{}\left(G}\right) be the number of subgraphs of isomorphic to , so that if G\sim\mathbb{G}\mathopen{}\mathclose{{}\left(n,p}\right) then we can interpret as the random variable that counts the number of copies of in a random graph.
These subgraph-counting random variables and their distributions are central objects of study in the theory of random graphs, going back to the foundational work of ErdĆs and RĂ©nyi [17]. Early work [9, 17, 35] concerned* *existence of subgraphs: fixing , for which and is it likely that , and for which and is it likely that ? It turns out that there is a threshold value of (as a decaying function of ) that cleanly separates these two behaviours, and further work [9, 10, 17, 25, 35, 37] focused on investigating the (Poisson-type) distribution of near this threshold.
In this paper, we are more interested in the behaviour far above this existence threshold (when is a constant independent of ). When appropriately normalised, has an asymptotically111All asymptotics, here and for the rest of the paper, are as , while is fixed. normal distribution (this was proved by Nowicki and Wierman [32] and RuciĆski [34], following several results [3, 24, 25] pushing increasingly further past the existence threshold). Further work by Barbour, KaroĆski and RuciĆski [4] provided quantitative bounds on the rate of convergence to the normal distribution. However, this asymptotic normality only characterises the âlarge-scaleâ behaviour of the distribution of , and is basically due to the fact that closely correlates with the number of edges in \mathbb{G}\mathopen{}\mathclose{{}\left(n,p}\right).
A more challenging direction of research is to understand âlocalâ aspects of the distributions of these subgraph-counting random variables222We remark that another completely different challenging direction of research is the study of large deviations of subgraph counts in random graphs. Recently, there has been a lot of progress in this area, see for example the monograph [11], the recent papers [2, 5, 8, 13, 22], and the references therein.. In the past decade, there have been a number of advances in this direction. Following work by Loebl, MatouĆĄek and PangrĂĄc [27] for the case where is a triangle, it was proved by Kolaitis and Kopparty [26] (see also [14]) that if we fix some p\in\mathopen{}\mathclose{{}\left(0,1}\right), some prime and some connected graph with at least one edge, then mod has an asymptotically uniform distribution on \mathopen{}\mathclose{{}\left\{0,\dots,q-1}\right\}. More recently, local central limit theorems have begun to emerge, giving asymptotic formulas for the point probabilities \Pr\mathopen{}\mathclose{{}\left(X_{H}=x}\right) in terms of a normal density function. Such a theorem was first proved for the case where is a triangle by Gilmer and Kopparty [20] (see also [6]), and this was extended by Berkowitz [7] to the case where is any clique.
In this paper we are concerned with a somewhat looser question: what can be said about the anticoncentration behaviour of ? Roughly speaking, this is asking for uniform upper bounds on the point probabilities \Pr\mathopen{}\mathclose{{}\left(X_{H}=x}\right), or more generally on the small ball probabilities \Pr\mathopen{}\mathclose{{}\left(X_{H}\in I}\right), where is an interval of prescribed length. Meka, Nguyen and Vu [30] developed some general polynomial anticoncentration inequalities, and used the polynomial structure of to prove the bound \Pr\mathopen{}\mathclose{{}\left(X_{H}=x}\right)\leq n^{-1+o\mathopen{}\mathclose{{}\left(1}\right)} for constant p\in\mathopen{}\mathclose{{}\left(0,1}\right) and any that contains at least one edge. In [19] we proposed the following conjecture.
Conjecture 1.1**.**
Fix p\in\mathopen{}\mathclose{{}\left(0,1}\right) and fix a graph with no isolated vertices. Then
[TABLE]
where is the number of vertices of .
The motivation for Conjecture 1.1 is that if is fixed then \operatorname{Var}X_{H}=\Theta\mathopen{}\mathclose{{}\left(n^{2v\mathopen{}\mathclose{{}\left(H}\right)-2}}\right) (see for example [23, Lemma 3.5]), and the aforementioned asymptotic normality therefore implies that is concentrated on an interval of length \Theta\mathopen{}\mathclose{{}\left(n^{v\mathopen{}\mathclose{{}\left(H}\right)-1}}\right). Provided that the distribution of is sufficiently âsmoothâ, we should expect each value in this interval to have comparable probability. Note that this line of reasoning implies that Conjecture 1.1, if true, is best possible: any stronger bound would contradict Chebyshevâs inequality. Also, observe that the assumption that has no isolated vertices is necessary: if is obtained from by removing isolated vertices then is a deterministic multiple of , so inherits its point probabilities.
We also remark that while Meka, Nguyen and Vu were the first to explicitly consider anticoncentration of subgraph counts in general, actually PangrĂĄc, MatouĆĄek and Loebl [27] considered anticoncentration of the triangle-count more than ten years earlier: a primary motivation for their aforementioned work on triangle-counts mod was to show that the point probabilities \Pr\mathopen{}\mathclose{{}\left(X_{K_{3}}=x}\right) are small (they gave a bound of O\mathopen{}\mathclose{{}\left(1/\log n}\right)), which in turn implies that two independent copies of \mathbb{G}\mathopen{}\mathclose{{}\left(n,p}\right) are unlikely to have the same Tutte polynomial. In addition, many of the other aforementioned results concerning the distribution of imply anticoncentration bounds: any central limit theorem already implies that \max_{x}\Pr\mathopen{}\mathclose{{}\left(X_{H}=x}\right)=o\mathopen{}\mathclose{{}\left(1}\right), and one can333The central limit theorem of Barbour, KaroĆski and RuciĆski was not stated in a way that allows one to directly read off estimates for probabilities regarding . But, it is possible to deduce such an estimate with the method of [33, Proposition 1.2.2]. deduce from the quantitative central limit theorem of Barbour, KaroĆski and RuciĆski [4] that \Pr\mathopen{}\mathclose{{}\left(X_{H}=x}\right)\leq\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left|X-x}\right|\leq n^{v\mathopen{}\mathclose{{}\left(H}\right)-3/2}}\right)=O\mathopen{}\mathclose{{}\left(1/\sqrt{n}}\right). The local central limit theorem of Berkowitz [7] definitively settles the matter in the case where is an -vertex clique, in which case it actually gives the asymptotically optimal bound \Pr\mathopen{}\mathclose{{}\left(X_{H}=x}\right)\leq\mathopen{}\mathclose{{}\left(2\pi\operatorname{Var}X_{H}}\right)^{-1/2}+o\mathopen{}\mathclose{{}\left(n^{1-h}}\right)=O(n^{1-h}).
In [19] we used ideas related to ErdĆsâ combinatorial proof of the ErdĆsâLittlewoodâOfford theorem (see [18]) to give a simple proof of the general bound \Pr\mathopen{}\mathclose{{}\left(X_{H}=x}\right)\leq\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left|X_{H}-x}\right|\leq n^{v\mathopen{}\mathclose{{}\left(H}\right)-2}}\right)=O\mathopen{}\mathclose{{}\left(1/n}\right), and we showed how to extend these methods to prove the sharper bound \Pr\mathopen{}\mathclose{{}\left(X_{K_{h}}=x}\right)=n^{1-h+o\mathopen{}\mathclose{{}\left(1}\right)} in the case where is a clique. In this paper we develop these methods much further, proving an approximate version of Conjecture 1.1 for all connected .
Theorem 1.2**.**
Fix p\in\mathopen{}\mathclose{{}\left(0,1}\right) and fix a connected graph . Then
[TABLE]
Concerning the -term in Theorem 1.2, the arguments in our proof yield a bound for the -term that decays extremely slowly as goes to infinity. In order to simplify the presentation of the proof and to avoid additional technical details, we decided to write the proof in a way that does not give any explicit bounds for the -term.
The general idea for the proof of Theorem 1.2 is to break up into different components that fluctuate at âdifferent scalesâ and handle each component separately. In the case where is a clique, this plan is relatively simple to execute, but in the more general setting of Theorem 1.2 there are a number of additional challenges that must be overcome. We discuss these in Section 2. Our proof has a number of new ingredients; one that is perhaps worth highlighting is a combinatorial anticoncentration inequality for vector-valued random variables that behave âalmost linearlyâ, in the spirit of some anticoncentration theorems due to HalĂĄsz. The details are in Section 3.
1.1 Basic definitions and notation
We use standard graph-theoretic notation throughout. In particular, the vertex and edge sets of a graph are denoted by and , and the sizes of these sets are denoted by and . For disjoint vertex sets in a graph , we write for the number of edges inside , and we write for the number of edges between and . Abusing notation, for a vertex we write to mean , which is the size of the -neighbourhood of in . We write for the neighbourhood of (that is, the set of vertices adjacent to ). A homomorphism from a multigraph to a multigraph consists of a map from the vertices of to the vertices of , and a map from the edges of to the edges of , such that whenever is an edge of between vertices and , the image is an edge between the vertices and .
We initially introduced as the random variable that counts unlabelled copies of (as is standard in this area), but for the proof it will be slightly more convenient to redefine to count the number of labelled copies of (injective homomorphisms from into ). The labelled/unlabelled distinction is irrelevant for Theorem 1.2, because these two counts differ by a fixed multiplicative factor (the number of automorphisms of ).
We use standard asymptotic notation throughout. For functions f=f\mathopen{}\mathclose{{}\left(n}\right) and g=g\mathopen{}\mathclose{{}\left(n}\right) we write f=O\mathopen{}\mathclose{{}\left(g}\right) to mean that there is a constant such that \mathopen{}\mathclose{{}\left|f}\right|\leq C\mathopen{}\mathclose{{}\left|g}\right|, we write f=\Omega\mathopen{}\mathclose{{}\left(g}\right) to mean there is a constant such that f\geq c\mathopen{}\mathclose{{}\left|g}\right| for sufficiently large , we write f=\Theta\mathopen{}\mathclose{{}\left(g}\right) to mean that f=O\mathopen{}\mathclose{{}\left(g}\right) and f=\Omega\mathopen{}\mathclose{{}\left(g}\right), and we write f=o\mathopen{}\mathclose{{}\left(g}\right) or g=\omega\mathopen{}\mathclose{{}\left(f}\right) to mean that as . Unless stated otherwise, all asymptotics are as (all other variables should be viewed as constant).
We will use notation of the form to indicate an expected value with respect to a random choice of (if there are other sources of randomness, then formally this is a conditional expected value). We write for the -Bernoulli distribution, meaning that if then and . Finally, we write for the set of non-negative integers, we write for the set , and all logarithms are to base .
2 Discussion and main ideas of the proof
Before discussing the new ideas in the proof of Theorem 1.2, it is worth reviewing the proofs in [19] giving weaker anticoncentration bounds. We will build on these ideas to prove Theorem 1.2.
First, to prove the bound \Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left|X_{H}-x}\right|\leq n^{v\mathopen{}\mathclose{{}\left(H}\right)-2}}\right)=O\mathopen{}\mathclose{{}\left(1/n}\right), the key observation was that is an âalmost linearâ function of the edges of the underlying random graph G\sim\mathbb{G}\mathopen{}\mathclose{{}\left(n,p}\right). Specifically, for a pair of vertices e=\mathopen{}\mathclose{{}\left\{x,y}\right\}, the difference \Delta X_{H}:=X_{H}\mathopen{}\mathclose{{}\left(G+e}\right)-X_{H}\mathopen{}\mathclose{{}\left(G-e}\right) is tightly concentrated around its expectation \mathbb{E}\Delta X_{H}=\Theta\mathopen{}\mathclose{{}\left(n^{v\mathopen{}\mathclose{{}\left(H}\right)-2}}\right), meaning that adding or removing an edge typically causes to increase or decrease by about this amount444This observation is closely related to the fact that the number of edges in is closely correlated with . This fact can be used to prove a central limit theorem for (see for example [23, Example 6.4]).. Using this observation, and some ideas from ErdĆsâ proof of the ErdĆsâLittlewoodâOfford theorem [18] and Lubellâs proof of the LYM inequality [28], it is possible to prove that the anticoncentration behaviour of is about the same as the anticoncentration behaviour of the number of edges of , which is a binomial random variable with parameters and . This gives the desired bound, which in some sense gives âcoarse scaleâ anticoncentration for .
Second, to prove the bound \Pr\mathopen{}\mathclose{{}\left(X_{K_{h}}=x}\right)\leq n^{1-h+o\mathopen{}\mathclose{{}\left(1}\right)} for cliques, the key idea was to fix a vertex and write X_{K_{h}}\mathopen{}\mathclose{{}\left(G}\right)=X_{K_{h}}\mathopen{}\mathclose{{}\left(G-v}\right)+X_{K_{h-1}}\mathopen{}\mathclose{{}\left(G\mathopen{}\mathclose{{}\left[N\mathopen{}\mathclose{{}\left(v}\right)}\right]}\right). That is, the number of -cliques in is the same as the number of -cliques in , plus the number of \mathopen{}\mathclose{{}\left(h-1}\right)-cliques in the neighbourhood of (which yield an -clique when combined with ). Now, it is possible to use similar ideas as in the preceding paragraph to show that X_{K_{h}}\mathopen{}\mathclose{{}\left(G-v}\right) is anticoncentrated at a coarse scale. On the other hand, X_{K_{h-1}}\mathopen{}\mathclose{{}\left(G\mathopen{}\mathclose{{}\left[N\mathopen{}\mathclose{{}\left(v}\right)}\right]}\right) has a much smaller order of magnitude, and it is actually concentrated on a relatively small interval around its expectation. Furthermore, we can bound the point probabilities for X_{K_{h-1}}\mathopen{}\mathclose{{}\left(G\mathopen{}\mathclose{{}\left[N\mathopen{}\mathclose{{}\left(v}\right)}\right]}\right) inductively.
Then, roughly speaking, the idea was as follows: We want to bound the probability of having X_{K_{h}}\mathopen{}\mathclose{{}\left(G}\right)=X_{K_{h}}\mathopen{}\mathclose{{}\left(G-v}\right)+X_{K_{h-1}}\mathopen{}\mathclose{{}\left(G\mathopen{}\mathclose{{}\left[N\mathopen{}\mathclose{{}\left(v}\right)}\right]}\right)=x. Since X_{K_{h-1}}\mathopen{}\mathclose{{}\left(G\mathopen{}\mathclose{{}\left[N\mathopen{}\mathclose{{}\left(v}\right)}\right]}\right) is concentrated on a small interval around its expectation \mathbb{E}X_{K_{h-1}}\mathopen{}\mathclose{{}\left(G\mathopen{}\mathclose{{}\left[N\mathopen{}\mathclose{{}\left(v}\right)}\right]}\right), in order to have X_{K_{h}}\mathopen{}\mathclose{{}\left(G}\right)=x the value of X_{K_{h}}\mathopen{}\mathclose{{}\left(G-v}\right) must (typically) be reasonably close to x-\mathbb{E}X_{K_{h-1}}\mathopen{}\mathclose{{}\left(G\mathopen{}\mathclose{{}\left[N\mathopen{}\mathclose{{}\left(v}\right)}\right]}\right). Using the coarse scale anticoncentration bounds for X_{K_{h}}\mathopen{}\mathclose{{}\left(G-v}\right), we can bound the probability that this happens. After knowing the value X_{K_{h}}\mathopen{}\mathclose{{}\left(G-v}\right), we know what value X_{K_{h-1}}\mathopen{}\mathclose{{}\left(G\mathopen{}\mathclose{{}\left[N\mathopen{}\mathclose{{}\left(v}\right)}\right]}\right) needs to take to have X_{K_{h}}\mathopen{}\mathclose{{}\left(G}\right)=x. By induction, we can bound the probability that X_{K_{h-1}}\mathopen{}\mathclose{{}\left(G\mathopen{}\mathclose{{}\left[N\mathopen{}\mathclose{{}\left(v}\right)}\right]}\right) takes this particular value.
If X_{K_{h}}\mathopen{}\mathclose{{}\left(G-v}\right) and X_{K_{h-1}}\mathopen{}\mathclose{{}\left(G\mathopen{}\mathclose{{}\left[N\mathopen{}\mathclose{{}\left(v}\right)}\right]}\right) were independent, it would be easy to conclude the desired anticoncentration bound \Pr\mathopen{}\mathclose{{}\left(X_{K_{h}}=x}\right)\leq n^{1-h+o\mathopen{}\mathclose{{}\left(1}\right)}; we would be able to simply multiply the above two probability estimates. Unfortunately, these random variables are not independent, so we need to rule out the possibility that the fluctuations in X_{K_{h}}\mathopen{}\mathclose{{}\left(G-v}\right) âcancel outâ the fluctuations in X_{K_{h-1}}\mathopen{}\mathclose{{}\left(G\mathopen{}\mathclose{{}\left[N\mathopen{}\mathclose{{}\left(v}\right)}\right]}\right) in a way that causes to concentrate on a particular value. The approach we took was to show that actually X_{K_{h-1}}\mathopen{}\mathclose{{}\left(G\mathopen{}\mathclose{{}\left[N\mathopen{}\mathclose{{}\left(v}\right)}\right]}\right) is anticoncentrated even after conditioning on a typical outcome of (after which the only remaining randomness comes from the set of neighbours N\mathopen{}\mathclose{{}\left(v}\right)).
We proved a conditional anticoncentration bound of this type by induction, using a moment argument. To be more specific, we viewed the conditional probabilities \Pr\mathopen{}\mathclose{{}\left(X_{K_{h-1}}\mathopen{}\mathclose{{}\left(G\mathopen{}\mathclose{{}\left[N\mathopen{}\mathclose{{}\left(v}\right)}\right]}\right)=z\,\middle|\,G-v}\right) as random variables depending on . To study these random variables we studied their high moments, which essentially comes down to considering collections of different candidates for the neighborhood , and bounding the probability that for each of these candidates we simultaneously have X_{K_{h-1}}\mathopen{}\mathclose{{}\left(G\mathopen{}\mathclose{{}\left[N\mathopen{}\mathclose{{}\left(v}\right)}\right]}\right)=z. We accomplished this with a multiple exposure argument: we iteratively went through our candidate sets for and exposed the edges of inside each set which had not yet been exposed before. Using a suitable induction hypothesis, and the ideas sketched earlier, at each step we can bound the probability that the corresponding candidate set for gives rise to the desired value of X_{K_{h-1}}\mathopen{}\mathclose{{}\left(G\mathopen{}\mathclose{{}\left[N\mathopen{}\mathclose{{}\left(v}\right)}\right]}\right). This way we obtained a suitable bound for the moment argument.
Now, there are several obstacles that need to be overcome to generalise the above argument beyond the case where is a clique. First, the decomposition X_{K_{h}}=X_{K_{h}}\mathopen{}\mathclose{{}\left(G-v}\right)+X_{K_{h-1}}\mathopen{}\mathclose{{}\left(G\mathopen{}\mathclose{{}\left[N\mathopen{}\mathclose{{}\left(v}\right)}\right]}\right) was very convenient for us: it allowed us to consider two separate random variables, one of which can be studied on a âcoarseâ scale using our LittlewoodâOfford type techniques, and the other of which can be studied inductively. Actually, it is not a huge problem to generalise this decomposition. In general, we have X_{H}=X_{H}\mathopen{}\mathclose{{}\left(G-v}\right)+X_{H}^{v}, where is the number of copies of in which contain . One can check that is a certain sum of weighted subgraph counts in , where each of the subgraphs has v\mathopen{}\mathclose{{}\left(H}\right)-1 vertices, and the weight of a subgraph depends on its intersection with the neighbourhood N\mathopen{}\mathclose{{}\left(v}\right) of . So, if we generalise the induction hypothesis to certain weighted sums of subgraph counts, it is still possible to control the anticoncentration of inductively.
The second main obstacle, which is more serious, concerns the multiple-exposure argument we used to show that X_{K_{h-1}}\mathopen{}\mathclose{{}\left(G\mathopen{}\mathclose{{}\left[N\mathopen{}\mathclose{{}\left(v}\right)}\right]}\right) is anticoncentrated even after conditioning on a typical outcome of . This crucially depended on the fact that in order to know the value of X_{K_{h-1}}\mathopen{}\mathclose{{}\left(G\mathopen{}\mathclose{{}\left[N\mathopen{}\mathclose{{}\left(v}\right)}\right]}\right) given a particular candidate for N\mathopen{}\mathclose{{}\left(v}\right), the only edges of that we need to expose are the edges inside N\mathopen{}\mathclose{{}\left(v}\right) (leaving the remaining edges for future rounds of exposure). Unfortunately, in general one may need to examine all the edges of to determine the value of , even if we fix a candidate for N\mathopen{}\mathclose{{}\left(v}\right). Specifically, this is the case whenever is not a complete multipartite graph (if is complete multipartite, then we do not need to expose the edges which lie completely outside N\mathopen{}\mathclose{{}\left(v}\right), and actually in this special case it is not too hard to extend the proof in [19] to prove Conjecture 1.1).
In the general case where is not a complete multipartite graph, in order to use a moment argument as above, we need some other way to estimate the joint probability that many candidates for N\mathopen{}\mathclose{{}\left(v}\right) each result in a specific outcome of . Write X_{H}^{v}\mathopen{}\mathclose{{}\left(N\mathopen{}\mathclose{{}\left(v}\right),G-v}\right) to indicate the dependence of on both N\mathopen{}\mathclose{{}\left(v}\right) and . For a collection of sets as candidates for N\mathopen{}\mathclose{{}\left(v}\right), we want to control the joint probability that all of X_{H}^{v}\mathopen{}\mathclose{{}\left(A_{1},G-v}\right),\dots,X_{H}^{v}\mathopen{}\mathclose{{}\left(A_{t},G-v}\right) are equal to a given value. Since we cannot consider these random variables separately anymore (as we did before with the multiple exposure argument), we need to somehow modify the induction hypothesis to handle this joint probability.
Specifically, we can generalise to a statement about joint anticoncentration probabilities of random variables of the following type (thus strengthening the induction hypothesis). Take a sequence of distinct vertices and for each , consider some collection of different candidates for the neighbourhood of . Then, consider the different random variables obtained by making different choices for the neighbourhoods for each of , and considering the number of copies of which contain all of , conditioned on having these neighbourhoods. The idea is that at each step of the induction we introduce a new vertex , and we consider many candidates for the neighbourhood of for a moment argument. Given that we are considering joint probabilities of random variables, our induction hypothesis needs to give a bound of the form on the joint probability that all our random variables are equal to particular values. This can be viewed as an anticoncentration bound for a random vector .
To make the above ideas work, we need multivariate generalisations of some of the ideas described so far. For example, we need an anticoncentration inequality for random vectors that are almost-linear in the sense sketched earlier, having the property that adding or removing an edge typically causes a predictable change in their values. There are some classical anticoncentration inequalities by HalĂĄsz [21] that give the kind of bounds we need, for random vectors that depend linearly (and ânon-degeneratelyâ) on a sequence of independent random choices. (Some kind of non-degeneracy assumption is necessary, to rule out situations where the random vector is always contained in a proper subspace of smaller dimension). The standard proofs of HalĂĄszâ inequalities are Fourier-analytic, and are not robust enough to apply to our almost-linear setting, but we were able to find some combinatorial arguments that apply to our setting, again inspired by proofs of ErdĆs [18] and Lubell [28]. More details are in Section 3.
In order to use the estimates in Section 3, we need to check a non-degeneracy condition: basically, we need to consider the effects of changing the status of various edges, and we need to show that the corresponding changes to are in âmany different directionsâ, spanning . We also need to check a similar non-degeneracy condition for the effects of adding or removing vertices from the various candidate neighbourhoods. Unfortunately, these non-degeneracy conditions do not hold for an arbitrary connected graph (they do hold, however, if has a vertex with edges to all other vertices). Therefore, we actually need to further modify our approach.
Instead of considering a single vertex in each step of the induction, we will consider different vertices, having a diverse range of adjacencies to the vertices previously chosen. Then, our decomposition is that we split the copies of into the copies that contain none of our identified vertices, and the copies that contain at least one of them. With this modification, there is a much richer range of possibilities for the effect of changing the status of an edge, and this allows us to prove the desired non-degeneracy condition. Unfortunately, while this modification is conceptually rather simple, it complicates notation enormously. We now need to maintain a collection of sets of vertices, and a collection of possibilities for the neighbourhoods of these vertices. To encode all of these data we introduce the notion of a colour system: each step of the induction is associated with a different colour, and at each step there are multiple âshadesâ of each colour indicating the different possibilities for the neighbourhoods of the various vertices introduced at that step. We can then state our induction hypothesis for a class of random variables defined in terms of colour systems, and prove it using the ideas we discussed in this outline.
3 Anticoncentration for âalmost-linearâ random vectors
The ErdĆsâLittlewoodâOfford theorem states that if are independent Bernoulli random variables satisfying , and is a linear combination of these random variables (where each coefficient has absolute value at least one) then for any we have . As outlined in Section 2, in [19, Theorem 1.2] we adapted ErdĆsâ proof of this theorem to handle the case of âalmost-linearâ functions of .
In [21], among other results, HalĂĄsz gave a multivariate generalisation of the ErdĆsâLittlewoodâOfford theorem, for sums of random vectors satisfying a certain non-degeneracy condition. Specifically, suppose that are -dimensional vectors with the property that for every unit vector , there are \Omega\mathopen{}\mathclose{{}\left(n}\right) indices with \mathopen{}\mathclose{{}\left|\mathopen{}\mathclose{{}\left\langle\boldsymbol{a}_{i},\boldsymbol{e}}\right\rangle}\right|\geq 1. HalĂĄsz proved that, with , we have \max_{\boldsymbol{x}\in\mathbb{R}^{d}}\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left\|\boldsymbol{X}-\boldsymbol{x}}\right\|_{2}\leq 1}\right)=O\mathopen{}\mathclose{{}\left(n^{-d/2}}\right). As outlined in Section 2, for the proof of Theorem 1.2 we will need a similar bound for almost-linear . Our non-degeneracy condition will be somewhat cruder than HalĂĄszâ; we assume that there are vectors , spanning , such that each of these vectors is represented \Omega\mathopen{}\mathclose{{}\left(n}\right) times as the direction of the âtypical effectâ of changing the status of some .
Theorem 3.1**.**
Fix real numbers and , integers , and vectors spanning . Then there is a constant , depending on , , and , such that the following holds. For any positive integer and any function \boldsymbol{f}:\mathopen{}\mathclose{{}\left\{0,1}\right\}^{n}\to\mathbb{R}^{d}, let \boldsymbol{\xi}\sim\operatorname{Ber}\mathopen{}\mathclose{{}\left(p}\right)^{n} and for define the random variables
[TABLE]
Suppose that for some positive real numbers and with there are disjoint subsets I_{1},\dots,I_{m}\subseteq\mathopen{}\mathclose{{}\left\{1,\dots,n}\right\} of size at least such that for each we have \Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left\|\Delta_{i}\boldsymbol{f}\mathopen{}\mathclose{{}\left(\boldsymbol{\xi}}\right)-s\boldsymbol{v}_{j}}\right\|_{\infty}\geq r}\right)\leq n^{-6d^{2}}. Then for any we have
[TABLE]
Roughly speaking, the assumption on the function in Theorem 3.1 means the following: For each of the vectors (with ) there is a reasonably large subset , such that for every the following holds. When changing the -th coordinate of from [math] to the corresponding vectors typically differ by roughly (more precisely, with high probability the difference of the corresponding vectors is close to the vector ). This condition can be seen as some sort of âalmost-linearityâ (at least with respect to certain coordinates of ). Intuitively, this condition suggests that for a random vector \boldsymbol{\xi}\sim\operatorname{Ber}\mathopen{}\mathclose{{}\left(p}\right)^{n}, the vector must be reasonably spread out and not too concentrated close to any given . Theorem 3.1 makes this intuition precise. The precise bound of for the probability \Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left\|\Delta_{i}\boldsymbol{f}\mathopen{}\mathclose{{}\left(\boldsymbol{\xi}}\right)-s\boldsymbol{v}_{j}}\right\|_{\infty}\geq r}\right) in the assumptions of Theorem 3.1 was chosen for convenience in the proof of the theorem. The exponent is certainly not sharp, but the precise value of this exponent is not relevant for the remainder of this paper, so we made no effort to optimise the exponent at the cost of complicating the proof of Theorem 3.1.
As mentioned above, Theorem 3.1 can be considered to be an analogue of HalĂĄszâ classical anticoncentration inequality for linear vector-valued functions [21] (satisfying a certain non-degeneracy condition), in the weaker setting of âalmost-linearâ functions. However, our non-degeneracy condition is somewhat more restrictive than the one in HalĂĄszâ original result. We also remark that an inequality very similar to Halaszâ was also proved by Tao and Vu [39, Theorem 1.4], and that there is a large body of work proving similar results without a non-degeneracy condition (in which case the bounds are much weaker; see for example the survey in [31, Section 2]).
Before starting the proof of Theorem 3.1, we record the following basic fact about lattices.
Lemma 3.2**.**
Fix a basis of . There exists a constant , only depending on , such that for any and any real number there are at most different -tuples of integers with \mathopen{}\mathclose{{}\left\|(t_{1}\boldsymbol{v}_{1}+\dots+t_{d}\boldsymbol{v}_{d})-\boldsymbol{x^{\prime}}}\right\|_{\infty}<z.
We next prove Theorem 3.1, further developing the ideas in the proof of [19, Theorem 1.2]. As mentioned above, the assumption on in Theorem 3.1 means that when changing the -th coordinate of from zero to one for some , the vector typically changes by roughly . This means that, if we successively change appropriately chosen zeros to ones in , we can (with sufficiently high probability) control the changes of the vector , and show that cannot be too often close to any given vector . Indeed, if we change the coordinates of with indices in , then the vector will typically move roughly along a lattice spanned by the vectors , and we can use Lemma 3.2 to show that there are not too many choices for where \mathopen{}\mathclose{{}\left\|\boldsymbol{f}(\boldsymbol{\chi})-\boldsymbol{x}}\right\|_{\infty}<r\sqrt{n\log n}.
Proof of Theorem 3.1.
First, by relabelling the given vectors , we may assume that form a basis of . We ignore the other vectors (in other words, we may assume that ).
Now, for , let , so . Let \boldsymbol{\xi}^{j}=\mathopen{}\mathclose{{}\left(\xi_{i}}\right)_{i\in I_{j}} be the restriction of the random vector \boldsymbol{\xi}\sim\operatorname{Ber}\mathopen{}\mathclose{{}\left(p}\right)^{n} to the coordinates in . Observe that the number of ones in is binomially distributed with parameters and . Therefore, for any we have that
[TABLE]
for a constant only depending on .
Now, observe that \Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left|\boldsymbol{\xi}^{j}}\right|=t}\right) is an increasing function in for and a decreasing function for . Thus, for each , there are integers such that for any integer we have
[TABLE]
That is to say, and are defined as the boundaries of the range of values that have probability at least of occurring as \mathopen{}\mathclose{{}\left|\boldsymbol{\xi}^{j}}\right|. We next bound the difference .
The Chernoff bound (see for example [1, Theorem A.1.4]) yields \Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left|\boldsymbol{\xi}^{j}}\right|<pn_{j}-d\sqrt{n\log n}}\right)\leq n^{-2d^{2}}\leq n^{-2d} and \Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left|\boldsymbol{\xi}^{j}}\right|>pn_{j}+d\sqrt{n\log n}}\right)\leq n^{-2d^{2}}\leq n^{-2d}. Thus, we must have
[TABLE]
and in particular for each . By the choice of and we have
[TABLE]
for every .
Now, for each , let be a uniformly random bijection (independently chosen for each ). Also, independently sample \chi_{i}\sim\operatorname{Ber}\mathopen{}\mathclose{{}\left(p}\right) for each . For any integers , let the vector be defined as follows. For , we already chose , the -th entry of . If , then set if and only if . In other words, among the entries for there are precisely ones and those are in positions \sigma_{j}\mathopen{}\mathclose{{}\left(1}\right),\dots,\sigma_{j}\mathopen{}\mathclose{{}\left(t_{j}}\right).
For any given , the random vector depends on the choices of the bijections for and the random entries \chi_{i}\sim\operatorname{Ber}\mathopen{}\mathclose{{}\left(p}\right) for . Very importantly, the distribution of is the same as the distribution of the random vector \boldsymbol{\xi}\sim\operatorname{Ber}\mathopen{}\mathclose{{}\left(p}\right)^{n} in the theorem statement conditioned on having \mathopen{}\mathclose{{}\left|\boldsymbol{\xi}^{j}}\right|=t_{j} for . In particular, fixing any , we have
[TABLE]
Hence, using the independence of the random variables as well as Equation 3.1, we obtain
[TABLE]
for each . On the other hand, Equation 3.3 implies
[TABLE]
(Note that to have for , we must have ). Thus, we obtain
[TABLE]
where the sum is taken over all . In other words,
[TABLE]
where is the random variable counting the number of -tuples with \mathopen{}\mathclose{{}\left\|\boldsymbol{f}\mathopen{}\mathclose{{}\left(\boldsymbol{\chi}(t_{1},\dots,t_{d})}\right)-\boldsymbol{x}}\right\|_{\infty}<r\sqrt{n\log n}. This random variable depends on the random choices of for and on \chi_{i}\sim\operatorname{Ber}\mathopen{}\mathclose{{}\left(p}\right) for .
Note that we always have . Furthermore, recall that the distribution of is the same as the distribution of \boldsymbol{\xi}\sim\operatorname{Ber}\mathopen{}\mathclose{{}\left(p}\right)^{n} conditioned on having \mathopen{}\mathclose{{}\left|\boldsymbol{\xi}^{j}}\right|=t_{j} for . This implies that for any , any and any , we have
[TABLE]
where we used Equation 3.2 and the assumption in the theorem that \Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left\|\Delta_{i}\boldsymbol{f}\mathopen{}\mathclose{{}\left(\boldsymbol{\boldsymbol{\xi}}}\right)-s\boldsymbol{v}_{j^{*}}}\right\|_{\infty}\geq r}\right)\leq n^{-6d^{2}}. Thus, by a union bound, the probability that there exist , and with \mathopen{}\mathclose{{}\left\|\Delta_{i}\boldsymbol{f}\mathopen{}\mathclose{{}\left(\boldsymbol{\chi}(t_{1},\dots,t_{d})}\right)-s\boldsymbol{v}_{j^{*}}}\right\|_{\infty}\geq r, is at most .
Now, fix , only depending on , as in Lemma 3.2.
Claim 3.3**.**
If \mathopen{}\mathclose{{}\left\|\Delta_{i}\boldsymbol{f}\mathopen{}\mathclose{{}\left(\boldsymbol{\chi}(t_{1},\dots,t_{d})}\right)-s\boldsymbol{v}_{j^{*}}}\right\|_{\infty}\leq r for all , all and all , then we have
Proof.
Note that for any and any with , the vectors and only differ in that the first of these vectors has a one in position , while the second has a zero in that position. Hence
[TABLE]
As , under the assumptions of the claim this implies
[TABLE]
for all and all with . Adding this up for different values of and using the triangle inequality, this implies that
[TABLE]
for all , where for the second inequality we used that for each .
Recall that is the number of (integer) -tuples which satisfy \mathopen{}\mathclose{{}\left\|\boldsymbol{f}\mathopen{}\mathclose{{}\left(\boldsymbol{\chi}(t_{1},\dots,t_{d})}\right)-\boldsymbol{x}}\right\|_{\infty}<r\sqrt{n\log n}. For each such -tuple we then have (by the triangle inequality)
[TABLE]
and therefore
[TABLE]
Thus by Lemma 3.2 applied with as well as (note that as by the assumptions of the theorem), we obtain that
[TABLE]
as desired. â
Just before Claim 3.3, we proved that its assumptions are satisfied with probability at least . Thus, we obtain
[TABLE]
using that . Plugging this into Equation 3.4, we can conclude
[TABLE]
where in the last inequality we used that . This implies the statement of Theorem 3.1 with . â
4 Colour systems and the induction hypothesis
As outlined in Section 2, our proof of Theorem 1.2 proceeds via induction over a class of random variables generalising subgraph counts. These random variables are defined via colour systems.
Definition 4.1**.**
For integers and , a colour system with parameters is a multigraph without loops which is coloured according to the following rules.
- âą
Each vertex has at most one colour and for each , there are exactly vertices of colour .
- âą
Each edge is incident to at least one coloured vertex.
- âą
Each edge has exactly one colour. If an edge is incident to exactly one coloured vertex, it receives the colour of that vertex. If an edge is incident to two coloured vertices, and these vertices have colours and , then the edge has colour .
- âą
Each edge of colour is additionally labelled with an integer in (we say that there are different shades) of colour . We do not assign shades to vertices, only edges.
- âą
Between any two vertices, there is at most one edge of each shade of each colour (but there can be multiple edges of different shades of the same colour).
The order of a colour system is its total number of vertices (both coloured and uncoloured).
For a colour system , let be the set of uncoloured vertices of . In most of the statements throughout the paper, we will consider colour systems whose order is large with respect to the parameters (in which case almost all vertices of are uncoloured). In all of our statements involving asymptotic notation, the colour system parameters will be treated as fixed constants for the asymptotic notation, while the order tends to infinity.
To make some sense of the definition of a colour system, recall from the outline in Section 2 that our proof is inductive, and at each step of the induction we consider multiple possibilities for the neighbourhoods of certain vertices. The colours in a colour system correspond to the vertices chosen at the different steps of the induction, and the different shades of colour correspond to different choices of neighbourhoods for the vertices of colour .
Now, we will mostly want to consider colour systems which have âtypicalâ structure, meaning that the sizes of intersections between neighbourhoods of vertices are about what one should expect if the neighbourhoods were chosen randomly. For this, we introduce a notion of âgeneral positionâ for families of sets.
Definition 4.2**.**
Consider subsets of some ground set . For any subset I\subseteq\mathopen{}\mathclose{{}\left\{1,\dots,m}\right\}, we write . For an integer and some , we say that are in -general position if for each I\subseteq\mathopen{}\mathclose{{}\left\{1,\dots,m}\right\}, we have
[TABLE]
Note that if , then and therefore in this case the empty collection of sets is in -general position for every integer .
Definition 4.3**.**
Say that a colour system is -general if the following holds. If we define to be the -neighbourhoods of each of the coloured vertices of in each of the shades of the respective colour (so we have if has parameters ), then the sets are in -general position.
Furthermore, say that the colour system is weakly -general if the sets are in -general position.
Note that every -general colour system is in particular weakly -general (the reason for having both these definitions is that when we make small changes to a collection of sets in -general position, the parameter changes slightly, and it is convenient to not have to explicitly keep track of this change). Also, note that for , every colour system with parameters is -general (since and the empty collection of sets is always in in -general position for all ).
Recall from Section 2 that the whole point of introducing multiple vertices at each step of the induction is to allow for a richer range of possibilities for the effect of changing the status of an edge. In order to ensure the richest possible range of possibilities, we consider colour systems which are complete in the sense that we see essentially all the possible adjacencies between the coloured vertices, as follows.
Definition 4.4**.**
Call a colour system with parameters complete, if for any the following holds. Suppose for each and each vertex in of colour we are given a subset . Then there exists a vertex of colour such that for every and every vertex of colour the vertices and are connected by edges of exactly those shades of colour that belong to the set .
Informally speaking, Definition 4.4 demands that for every colour we can find a vertex with prescribed edges to all the vertices of the previous (smaller) colours. For every colour system with parameters is complete, as the condition in Definition 4.4 is vacuous.
Note that whether a colour system with parameters is complete only depends on the edges between the coloured vertices, and it does not at all depend on the edges in colour . In contrast, whether is -general for given only depends on the edges between the coloured and the uncoloured vertices.
Now, to obtain a graph from a colour system, we choose a shade of each colour, and we choose a graph on the uncoloured vertices, as follows.
Definition 4.5**.**
Let be a colour system with parameters . Then, given a -tuple , and a graph on the vertex set , define a graph by taking all vertices of together with all edges of and all edges of shade of colour for all . Furthermore, for a graph let be the number of labelled copies of in the graph which use at least one vertex of each of the colours.
We will use notation such as to denote the function that maps to . Our goal for the rest of this paper will be to prove the following strengthening of Theorem 1.2.
Theorem 4.6**.**
Fix some , an -vertex graph , and integers and . Let . Then for any -general complete colour system of order with parameters and for any function the following holds. If is a random graph on the vertex set , then
[TABLE]
The -term in Theorem 4.6 goes to zero as tends to infinity, but it may depend on the choices for , , , and fixed in the beginning of the theorem statement (in other words, these values are are treated as constants in the asymptotics).
Note that for , in Theorem 4.6 we have (using the convention that the empty product is equal to 1), and the colour system has no coloured vertices (so it consists of uncoloured vertices and no edges). We already observed that such a colour system is always -general and complete. Furthermore, is simply the number of labelled copies of in . This quantity is precisely the random variable in Theorem 1.2. Thus, Theorem 4.6 for states that for all we have with probability at most . This is precisely the statement of Theorem 1.2.
So, Theorem 1.2 corresponds to the case in Theorem 4.6, and it therefore suffices to prove Theorem 4.6. We will use backwards induction starting from and going down to . Note that the case is trival.
For the rest of the paper, we fix a particular graph with vertices and some . Before concluding this section, we make a few more definitions and state an important intermediate result for the induction step (Proposition 4.10, to follow). Basically, at each step of the induction, we have a colour system with colours, and we add vertices of a new colour with random neighbourhoods, obtaining a random colour system . We will use the ââ case of Theorem 4.6 and a moment argument to show that typically has the property that is anticoncentrated, subject only to the randomness in . This will be the content of Proposition 4.10.
Definition 4.7**.**
For integers and , define a restricted colour system with parameters to be a colour system with parameters in which there are no edges in colour between the coloured and the uncoloured vertices (so all edges of colour are between the vertices of colour , recalling that the colour of an edge is the minimum of the colours of its endpoints).
Call a restricted colour system with parameters complete, if it is complete when viewed as a colour system with parameters . Call a restricted colour system with parameters essentially -general, if the colour system with parameters obtained by ignoring colour is -general. Similarly, call essentially weakly -general, if the colour system obtained by ignoring colour is weakly -general.
Definition 4.8**.**
Consider a restricted colour system with parameters . For each vertex of colour in , choose a random subset by taking each element of independently with probability (and choose the different sets all independent from each other). Let be the collection of random sets chosen this way. Then we can obtain a random colour system with parameters by connecting each vertex of colour to all vertices in and colouring all these new edges in the unique shade of colour .
Definition 4.9**.**
Consider and an essentially -general complete restricted colour system with parameters . We say that a graph on the vertex set is -dispersed if for all functions the following holds. When choosing randomly as in Definition 4.8, we have \Pr\mathopen{}\mathclose{{}\left(\psi_{H}(\mathcal{G_{S}},G_{0},\cdot)=\lambda}\right)\leq q.
Now we are ready to state Proposition 4.10, as announced above.
Proposition 4.10**.**
Fix integers and and let . Then there exists a function with , such that for any essentially -general complete restricted colour system of order with parameters the following holds. If is a random graph on the vertex set , then with probability the graph is -dispersed, where .
We remark that in the case , we have in Proposition 4.10 (again by the convention that the empty product is equal to ).
The rest of the paper is organised as follows. In Section 5 we give some very straightforward lemmas about sets in general position (in particular, random collections of sets are very likely to be in general position). In Section 6 we use a moment argument to prove that the ââ case of Theorem 4.6 implies the corresponding case of Proposition 4.10. In Section 7 we show how this case of Proposition 4.10 implies the ââ case of Theorem 4.6, completing the induction step. The contents of both of these sections consist mostly of calculations and putting various pieces together. However, in both these sections we omit the proofs of important anticoncentration lemmas (Lemmas 6.3 and 7.3). The rest of the paper is spent proving these lemmas via our new multivariate anticoncentration inequality in Theorem 3.1. In Section 8 we introduce the formalism of a âcoreâ, which will be used for the proofs of Lemmas 6.3 and 7.3. To be specific, we prove a linear independence lemma for certain vectors defined in terms of cores, which we will use to check the linear independence condition in Theorem 3.1. Finally, in Section 9 we put everything together to prove Lemmas 6.3 and 7.3.
5 Sets in general position
In this section we record some simple lemmas regarding sets in general position. Recall that in the last section we fixed some .
Lemma 5.1**.**
Suppose that are subsets of some ground set which are in -general position, for some integer . Then for every subset I\subseteq\mathopen{}\mathclose{{}\left\{1,\dots,m}\right\}, we have
[TABLE]
Proof.
Fix some subset I\subseteq\mathopen{}\mathclose{{}\left\{1,\dots,m}\right\}. For every subset J\subseteq\mathopen{}\mathclose{{}\left\{1,\dots,m}\right\} with , let be as in Definition 4.2. Note that there are at most such subsets . Also, note that the set is the union of all the set for all J\subseteq\mathopen{}\mathclose{{}\left\{1,\dots,m}\right\} with , and all these sets are disjoint. Hence
[TABLE]
Here, and in the rest of this proof, the sum is over all subsets J\subseteq\mathopen{}\mathclose{{}\left\{1,\dots,m}\right\} with . Noting that
[TABLE]
we obtain that
[TABLE]
as desired. â
Lemma 5.2**.**
Fix some . Let be an -element set and let be in -general position for some integer . Let be a random set chosen by taking each element of independently with probability . Then with probability the sets are in -general position.
Proof.
For any subset I\subseteq\mathopen{}\mathclose{{}\left\{1,\dots,m}\right\}, let be as in Definition 4.2. We need to show that with probability we have
[TABLE]
and
[TABLE]
for each I\subseteq\mathopen{}\mathclose{{}\left\{1,\dots,m}\right\}. By a union bound over all choices for , it suffices that for each individual set I\subseteq\mathopen{}\mathclose{{}\left\{1,\dots,m}\right\} each of these properties holds with probability .
Fix some I\subseteq\mathopen{}\mathclose{{}\left\{1,\dots,m}\right\}. By assumption, the set satisfies
[TABLE]
Note that \mathopen{}\mathclose{{}\left|S_{I}\cap S_{m+1}}\right| is binomially distributed with parameters (\mathopen{}\mathclose{{}\left|S_{I}}\right|,p). Thus, the probability to have
[TABLE]
is by the Chernoff bound at least
[TABLE]
where we used that as . Whenever Equation 5.4 is satisfied, then together with Equation 5.3 we obtain by the triangle inequality
[TABLE]
as desired. Thus Equation 5.1 indeed holds with probability . Similarly, we can show that Equation 5.2 holds with probability by considering the probability that
[TABLE]
holds, using that \mathopen{}\mathclose{{}\left|S_{I}\cap(R\!\setminus\!S_{m+1})}\right| is binomially distributed with parameters (\mathopen{}\mathclose{{}\left|S_{I}}\right|,1-p). â
Lemma 5.3**.**
Fix some positive integer . Then the following holds for sufficiently large . Let be an -element set and let be in -general position for some integer . Let be an integer with . Then for any subset obtained by deleting elements from , the sets are in -general position.
Proof.
We need to show that for every subset I\subseteq\mathopen{}\mathclose{{}\left\{1,\dots,m}\right\} we have
[TABLE]
So fix some I\subseteq\mathopen{}\mathclose{{}\left\{1,\dots,m}\right\}. By assumption, the set satisfies
[TABLE]
Now, is obtained from by deleting at most elements, and therefore
[TABLE]
Furthermore, we have
[TABLE]
Thus, the triangle inequality yields (using )
[TABLE]
as long as is sufficiently large with respect to . â
Corollary 5.4**.**
Fix , and suppose is sufficiently large with respect to these values. Let be a -general colour system of order with parameters . Then, whenever we delete vertices in , the resulting colour system is weakly -general.
Proof.
Let be the -neighbourhoods of each of the coloured vertices of in each shade of their respective colours. Then are in -general position. Thus, by Lemma 5.3, whenever we delete vertices in , the resulting configuration of sets is in -general position. â
Lemma 5.5**.**
Fix . If is a an essentially weakly -general restricted colour system of order with parameters , then the colour system in Definition 4.8 is -general with probability .
Proof.
Let be the colour system of order with parameters obtained from the restricted colour system by ignoring colour (so consists of the set , together with the vertices of colour in ). By the assumption on , the colour system is weakly -general. Let be the -neighbourhoods of each of the coloured vertices of in each shade of their respective colours (so we have ). Then the sets are in -general position. Note that and , and that , so by Lemma 5.3 (assuming is large), the sets are in -general position. Applying Lemma 5.2 times, we see that with probability at least these sets together with the random sets in are in -general position. This proves Lemma 5.5. â
6 Random neighbourhoods: Theorem 4.6 implies Proposition 4.10
In this section we will prove that if Theorem 4.6 holds for some , then Proposition 4.10 also holds for this value of .
Fix some and assume that Theorem 4.6 holds for this value of . Let be arbitrary positive integers. Our goal in this section is to prove Proposition 4.10 for these values of , .
For the entirety of this section, we fix these values of , and define . In all our asymptotics, these values will be treated as constants, while .
In Section 6.1 we will start with some preparations. First, we use a martingale concentration inequality to prove that is very likely to be close to its conditional expectation given (by symmetry, this conditional expectation actually only depends on the sizes of the intersections between the sets in and the neighbourhoods of the various coloured vertices). Second, we state (but do not yet prove) an anticoncentration lemma for , subject to the randomness in .
In Section 6.2 we will use our preparatory lemmas and Theorem 4.6 to prove an anticoncentration bound for certain joint probabilities concerning the values of for different choices of . This will be the key input for a moment argument (as outlined in Section 2) with which we will deduce that is very likely to be dispersed, proving the desired case of Proposition 4.10. The details of this deduction will be presented in Section 6.3.
6.1 Preparations
Definition 6.1**.**
For a restricted colour system with parameters and an outcome of the random collection of sets in Definition 4.8, let be the function given by
[TABLE]
for all . Here, is a random graph on the vertex set .
We remark that only depends on and , and not .
Lemma 6.2**.**
For any essentially -general complete restricted colour system which has parameters , the following holds. If we choose a random graph and independently choose randomly as in Definition 4.8, then
[TABLE]
Proof.
Condition on an arbitrary outcome of . By a union bound, it suffices to prove that for each the probability that
[TABLE]
is . So fix some . The expectation of is precisely (recall that we are conditioning on an outcome of ).
Note that changing the status of an edge of changes by at most . This is because are at most different labelled copies of in the -vertex graph which contain any particular pair of vertices of , and contain at least one vertex of each of the colours. Thus, by the AzumaâHoeffding inequality (see for example [1, Theorem 7.2.1]) with the edge-exposure martingale, the probability that Equation 6.1 occurs is at most
[TABLE]
This finishes the proof of Lemma 6.2. â
Recall that we fixed and throughout this section, and that we defined .
Lemma 6.3**.**
For any essentially -general complete restricted colour system of order with parameters and any the following holds. If we choose randomly as in Definition 4.8 then we have
[TABLE]
We defer the proof of Lemma 6.3 to Section 9.
6.2 Joint anticoncentration
The following statement is the key lemma for the moment argument we will use to finish the proof of Proposition 4.10.
Lemma 6.4**.**
Fix . Then for any essentially -general complete restricted colour system of order with parameters , and any function , the following holds. If we choose, all independently, a random graph on the vertex set as well as random collections chosen as in Definition 4.8, then
[TABLE]
Proof.
Fix an essentially -general complete restricted colour system of order with parameters , and a function .
For an outcome of the random collections , we obtain a colour system with parameters as follows. Recall that by Definition 4.7, has only one shade of colour and all edges of colour are between the vertices of colour . Replace each of these edges by parallel edges with all shades of colour . Also, for each and each vertex with colour , add edges, in shade of colour , between and all vertices in the set (where is the set corresponding to the vertex in the collection ). We emphasise that only depends on the random choices of , and not on the random choice of .
Note that and that for any , deleting all edges of colour except those of shade yields precisely the colour system with parameters . Thus, for any outcome of , we have
[TABLE]
for all and all . Also note that is always complete, because is complete and being complete does not depend on the edges of colour .
Say that an outcome of is common if both of the following conditions hold.
- (i)
we have
[TABLE]
for all ;
- (ii)
the colour system given by and is -general.
Claim 6.5**.**
* is common with probability .*
Proof.
By Lemma 6.2, for each the probability that (i) fails is at most . Thus, (i) holds with probability at least .
For (ii), note that by Lemma 5.5 the colour system is -general with probability . We can then apply Lemma 5.2 times for all the additional sets in . â
Now, in order to prove Lemma 6.4, we need to bound the probability that
[TABLE]
We remark that if we condition on an outcome for satisfying (ii), then one can apply Theorem 4.6 to get an upper bound on this probability (using only the randomness of ). However, this bound is weaker than the one claimed in Lemma 6.4. We will be able to obtain a stronger bound by using the randomness of both and of . For the argument, both of the conditions (i) and (ii) will be relevant.
Whenever the outcome of is common (specifically, when (i) is satisfied), in order to satisfy Equation 6.3 we must have
[TABLE]
Note that Equation 6.4 does not depend on , only on the random sets in . By Lemma 6.3, for each the probability of having \mathopen{}\mathclose{{}\left\|\mu_{\mathcal{G},\mathcal{S}_{i}}-\lambda}\right\|_{\infty}\leq n^{h-g-1}\cdot\log n is at most . So, by the independence of the , the probability that Equation 6.4 holds is bounded as follows.
[TABLE]
Now, fix any outcomes of satisfying (ii) and Equation 6.4. By condition (ii), the colour system is -general, and furthermore recall that is complete. Hence, applying Theorem 4.6 to the colour system (which has parameters ), conditioned on our outcomes of , we have
[TABLE]
(Recall that in this section we are assuming that Theorem 4.6 holds for ). In other words, recalling Equation 6.2, if we condition on any outcomes of such that (ii) and Equation 6.4 hold, then the random choice satisfies Equation 6.3 with probability at most .
Combining this with Equation 6.5, we see that the probability that simultaneously satisfies Equation 6.4, (ii) and Equation 6.3 is at most
[TABLE]
Recall that any common outcome satisfying condition Equation 6.3 also needs to satisfy Equation 6.4, and recall from Claim 6.5 that is common with probability at least . Thus, the total probability that Equation 6.3 holds is at most . This finishes the proof of Lemma 6.4. â
In the proof of Proposition 4.10, we will use the following corollary of Lemma 6.4. In contrast to the setting of Lemma 6.4, in 6.6 we will not require to be fixed, but just that is sufficiently large with respect to (and with respect to the values for and that we fixed for this entire section). Also note that the statement of 6.6 does not contain any asymptotic notation.
Corollary 6.6**.**
As before, let and be fixed. Then for every , there exists such that the following holds for any essentially -general complete restricted colour system of order with parameters , and any function . If we choose, all independently, a random graph on the vertex set as well as random collections chosen as in Definition 4.8, then
[TABLE]
Proof.
For each , the -term in the statement of Lemma 6.4 with this particular value of converges to zero (as goes to infinity). Hence there is some such that this -term is at most for all . In other words, this means that whenever , the probability appearing in the statements of Lemma 6.4 and 6.6 is indeed at most , as desired. â
6.3 Completing the proof of Proposition 4.10
In this subsection, we will finally deduce Proposition 4.10 for our given value of . In order to do so, we will apply 6.6 with a carefully chosen value of . Recall that is not required to be fixed in 6.6, it is only required that is sufficiently large with respect to (and the values of and that we fixed throughout this section). Let be a function mapping each to some such that 6.6 holds.
In order to prove Proposition 4.10, we can assume that . For any integer , let us define to be the maximum with . Note that then we have for all , and that .
Define the function by for all , and observe that . Let be any essentially -general complete restricted colour system of order with parameters . We need to show that a randomly chosen graph on the vertex set is -dispersed with probability , where
[TABLE]
Recall from Definition 4.9 that a graph on the vertex set being -dispersed means that for all functions the following condition holds: when choosing randomly as in Definition 4.8, \Pr\mathopen{}\mathclose{{}\left(\psi_{H}(\mathcal{G_{S}},G_{0},\cdot)=\lambda}\right)\leq q. Let be the event that fails to satisfy this condition (i.e. that the probability is strictly larger than ) for some particular . So, we need to show that with probability , no occurs. Note that we always have , so we only need to consider possibilities for . It therefore suffices to show that for each : we then can take a union bound over all possibilities for . So, fix a function .
Let . All independently, choose a random graph , and choose collections as in Definition 4.8. By 6.6 (using that ), we have
[TABLE]
We remark that the probability on the left-hand side of Equation 6.6 can be interpreted as the -th moment of the random variable (depending on ) measuring the conditional probability that , for a random choice of as in Definition 4.8. The rest of the proof, to follow, can basically be interpreted as applying Markovâs inequality with this -th moment.
By the definition of , if we condition on an outcome of for which holds, then for each , with probability at least we have . Since the are independent, we deduce that
[TABLE]
Together with Equation 6.6, this implies
[TABLE]
and therefore, recalling ,
[TABLE]
(Recall that and that ). This concludes the proof.
7 Completing the induction step: Proposition 4.10 for implies Theorem 4.6 for
For the duration of this section, we fix some and assume that Proposition 4.10 holds for this value of . Our goal for this section is to prove Theorem 4.6 for . Our approach is as outlined in Section 2: we decompose into two parts, use a âcoarse-scaleâ anticoncentration lemma to handle the larger part, and use our assumed case of Proposition 4.10 to handle the smaller part.
Let be arbitrary positive integers. Our goal is to prove Theorem 4.6 for and these values of .
For this entire section, let us fix the values of , and define and . In all our asymptotics, these values will be treated as constants, while . In Section 7.1 we will prove a concentration lemma which we will use to control the fluctuation of the smaller of our two parts. We also state (but do not yet prove) an anticoncentration lemma which we will use for the larger of our two parts. In Section 7.2 we combine these lemmas with our assumed case of Proposition 4.10 to prove our desired case of Theorem 4.6.
7.1 Preparations
Definition 7.1**.**
Consider an essentially -general complete restricted colour system with parameters . Let be the function given by
[TABLE]
for all . Here, is a random graph on the vertex set and is a randomly chosen collection of sets as in Definition 4.8 (and and are chosen independently).
We remark that only depends on .
Lemma 7.2**.**
The following holds for any essentially -general complete restricted colour system with parameters . If we choose a random graph and independently choose randomly as in Definition 4.8, then
[TABLE]
Proof.
By a union bound, it suffices to prove that for each -tuple the probability to have
[TABLE]
is . So fix some -tuple . Note that the expectation of is precisely .
Note that the random collection as in Definition 4.8 consists of different sets , one for each vertex of colour . For any vertex , changing which of the sets the vertex belongs to, changes the edges in the graph between and the vertices of colour . Hence it changes the value of by at most (since there are at most that many labelled copies of in the -vertex graph which contain as well as at least one vertex of each of the colours).
Consider the Doob martingale with respect to obtained by first exposing the graph one edge at a time, and then exposing the random collection in the following way. In each step, for one vertex , we expose which of the sets of the collection contain the vertex . As in the proof of Lemma 6.2, changing the status of an edge of changes by at most . So, by the AzumaâHoeffding inequality the probability that Equation 7.1 occurs is at most
[TABLE]
This finishes the proof of Lemma 7.2. â
Lemma 7.3**.**
The following holds for any weakly -general complete colour system of order with parameters and any . If we choose a random graph on the vertex set then
[TABLE]
We defer the proof of Lemma 7.3 to Section 9.
7.2 Proof of Theorem 4.6 for
In this subsection, we deduce Theorem 4.6 for from Proposition 4.10 for . Consider any -general complete colour system of order with parameters and any function . We may assume that is sufficiently large with respect to the parameters that were fixed throughout this section.
Since is a -general colour system, the -neighbourhoods of each of the coloured vertices of in each of the shades of the respective colour are in -general position. This implies that there exist vertices in representing all possible ways to be adjacent to the coloured vertices. To be specific, for each and each vertex in of colour , consider any subset . There is a vertex such that for every and every vertex of colour , between and there are edges of exactly those shades of colour that belong to the set . For each choice of the subsets , fix a particular such vertex , and let be the set of all these fixed vertices . Then, has size
[TABLE]
To prove Theorem 4.6, we need to show that for a random graph , we have
[TABLE]
Fix any possible outcome of the edges of the graph between the vertices in . We will prove that the event occurs with probability at most conditioned on this outcome.
We can obtain a colour system of order with parameters from the colour system by deleting the vertices in . By 5.4, since is -general, is weakly -general. Also, is complete, because is complete and being complete does not depend on any of the uncoloured vertices.
Furthermore, from and our fixed outcome of , we can obtain a restricted colour system of order with parameters in the following way. First, colour all the vertices in with colour . Then, take a single shade of colour and colour all edges in with this single shade of colour . The restricted colour system so obtained is complete and essentially -general, by our choice of and the assumptions that is complete and -general.
Given our fixed outcome of , we can choose the rest of the random graph in two steps as follows. First, we choose a random graph on the vertex set . Then, choose random subsets for each vertex by taking each element of independently with probability (and choose all these sets independently of each other and independent of ). Now, take to be the graph on the vertex set obtained by starting with the union of the edges of the graphs and , and adding edges between each vertex and all the vertices in .
Let be the collection of random sets chosen above. Note that the random choice of is precisely what is described in Definition 4.8 for the restricted colour system (recall that is the set of vertices of colour in the restricted colour system ).
Then, for any outcome of and , and any , the graph is the same as the graph . (Here is the colour system obtained from the restricted colour system as in Definition 4.8.) Recall that by definition is the number of labelled copies of in the graph which use at least one vertex of each of the colours . If such a copy of contains at least one vertex in , then it is one of the labelled copies of in the graph which use at least one vertex of each of the colours of . On the other hand, if it does not contain a vertex of , then it is one of the labelled copies of in the graph which use at least one vertex of each of the colours . Thus, for any outcome of and , we have
[TABLE]
For random and as above, it therefore suffices to prove that
[TABLE]
Let be the function obtained by applying Proposition 4.10 with parameters , (recall that we are assuming that Proposition 4.10 holds for ). So . Call an outcome of typical if the following two conditions hold.
- (a)
For , the graph is -dispersed;
- (b)
we have \mathopen{}\mathclose{{}\left\|\psi_{H}(\mathcal{G^{\prime}_{S}},G_{0}^{-},\cdot)-\mu_{\mathcal{G^{\prime}}}}\right\|_{\infty}\leq n^{h-g-(1/2)}\cdot\log n.
Claim 7.4**.**
* is typical with probability .*
Proof.
Recall that is an essentially -general complete restricted colour system of order with parameters . Its set of uncoloured vertices is . Thus, by Proposition 4.10 (which we assumed to hold for ), the random graph is -dispersed with probability . This shows that (a) holds with probability .
On the other hand, (b) holds with probability by Lemma 7.2 applied to the restricted colour system . â
Note that whenever an outcome of is typical (specifically, whenever (b) holds), we cannot have unless
[TABLE]
(here we used that is sufficiently large with respect to ).
Consider the function defined by . By Lemma 7.3 applied with the function and the weakly -general complete colour system of order with parameters , we see that Equation 7.3 holds with probability at most .
Note that both Equation 7.3 and (a) only depend on the random choice of and not on the random choice of . For any outcome of such that Equation 7.3 and (a) hold, the conditional probability of the event that is at most . (This follows directly from the definition of being -dispersed; see Definition 4.9).
Thus, the total probability that is typical and satisfies is at most
[TABLE]
recalling that . By Claim 7.4, the probability that is not typical is , so the probability in Equation 7.2 is at most . This finishes the proof of Theorem 4.6 for .
8 Cores and non-degeneracy
It remains to prove Lemmas 6.3 and 7.3. Both of these lemmas will be proved using our new multivariate anticoncentration inequality (Theorem 3.1), which requires us to check a non-degeneracy condition for a certain collection of vectors. In this section we introduce the formalism of cores, which are special types of colour systems of bounded size. For a core , we will define certain functions (which may be interpreted as belonging to a vector space of functions), and we show that under certain conditions these functions span the entire space.
The point of this section is that in the settings of both Lemma 6.3 and Lemma 7.3, the collections of vectors that we need to study to apply Theorem 3.1 are in correspondence with collections of functions , for appropriately chosen cores . In the next section (Section 9) we will specify how to actually choose the cores for these correspondences. Throughout this section, fix any (recall that is the number of vertices of our fixed graph ).
Definition 8.1**.**
For integers and , a core with parameters is a colour system with parameters which has exactly one uncoloured vertex, such that the uncoloured vertex has edges to all coloured vertices in all possible shades (meaning that for each , the uncoloured vertex has edges in all shades of colour to all vertices of colour ).
Note that for fixed parameters there are only finitely many different cores with these parameters.
A core with parameters is called complete if it is complete when viewed as a colour system with parameters .
Definition 8.2**.**
A partial copy of in a core with parameters is a labelled copy of a -vertex induced subgraph of which uses one vertex of each colour in and the uncoloured vertex. More formally, a partial copy of in a core is given by a graph homomorphism for some subset of size such that the image of contains one vertex of each colour and the uncoloured vertex.
Note that the homomorphism in Definition 8.2 is automatically injective on vertices (and therefore also on edges).
Definition 8.3**.**
Consider a core with parameters . For a subset of size , define the weight of a partial copy of to be . Also, for a -tuple , say that a partial copy of in is -coloured if for each all the edges of colour in the image of have shade .
Note that if a partial copy of in does not contain any edges of colour in its image, then it can be -coloured for different values of . However, if has at least one edge of colour in its image, then it can only be -coloured if is the shade of the edges of colour in the image of (if there are different edges of colour with different shades in the image of , then is not -coloured for any ).
Definition 8.4**.**
For a core with parameters and a collection of edges , let be the function defined as follows. For all , let be the sum of the weights of all -coloured partial copies of in whose image contains all edges in . If just consists of one edge , we write instead of .
Note that we have if for some the set contains an edge of colour with a shade distinct from , because then there are no -coloured partial copies of in whose image contains this edge.
Now, the main result of this section is as follows, showing that the functions satisfy a non-degeneracy condition.
Lemma 8.5**.**
Let be a complete core with parameters . Consider the functions , where ranges over all edges between the uncoloured vertex of and a vertex of colour . Then these functions span the real vector space of all functions .
8.1 Proof of Lemma 8.5
For this subsection, fix a core with parameters . Let be the span of the functions , for all edges between the uncoloured vertex of and a vertex of colour . Note that is a subspace of the real vector space of all functions . Our goal is to show that is actually the entire space of functions .
First, let us make some more definitions.
Definition 8.6**.**
For an integer , a downward tree of size in is a collection of edges of colours which form a tree containing the uncoloured vertex of as a leaf.
Although formally a downward tree is just a collection of edges, we say that contains a vertex of if this vertex is part of the tree formed by the edges in . For an example of a downward tree, see Figure 1.
Lemma 8.7**.**
For any integer , every downward tree of size in contains the uncoloured vertex and exactly one vertex in each of the colours (and no vertices in the colours ). Furthermore, the vertex of colour is a leaf in the tree . Finally, if and is the unique edge of incident to the vertex of colour , then is not incident to the uncoloured vertex and is a downward tree of size .
Proof.
Since has an edge in each of the colours , it must also have a vertex in each of these colours (recall that in a colour system an edge can only have colour if it is incident to a vertex of colour ). Thus, contains at least one vertex in each of the colours and by definition it also contains the uncoloured vertex. As has only vertices, this establishes the first part of the lemma.
For the second part we need to show that there is only one edge of incident to the vertex of colour . However, note that each edge of incident to the vertex of colour has colour (because the other vertex of each such edge is either uncoloured or has a colour with a number larger than ). As has only one edge of colour , there is indeed only one edge of incident to the vertex of colour .
Finally, for the third part, note that the edge has colour (by the argument above), so is a tree with edges of colours . The edge is not incident to the uncoloured vertex, as otherwise both vertices of would be leaves in which would contradict . In particular, the uncoloured vertex is still a leaf of the tree . Hence is indeed a downward tree of size . â
Lemma 8.8**.**
For every downward tree , we have .
Proof.
We prove the lemma by induction on .
If , then for a single edge . By definition, the uncoloured vertex is a leaf of , so is incident to the uncoloured vertex. Furthermore, has colour , so it is also incident to a vertex of colour . We therefore trivially have , by the definition of .
Now, let us assume that and that the lemma statement holds for . Let be a downward tree of size . Recall that by Lemma 8.7, contains precisely one vertex of colour and this vertex is a leaf in . So let be the unique edge in incident to . Then by the last part of Lemma 8.7, is a downward tree of size . Finally, let be the other vertex of , so that (again by Lemma 8.7) is coloured with one of the colours .
Applying Lemma 8.7 to the downward tree , we can label the vertices of as , such that is the uncoloured vertex of and such that each vertex has colour . Note that is one of the vertices .
Now, since is complete (recall Definition 4.4), we can recursively define vertices in , with colours respectively, such that the following three conditions are satisfied.
- âą
For all , the vertices and are connected by edges of exactly the same shades of colour as the vertices and .
- âą
For all and all vertices of of any of the colours such that , the vertices and are connected by edges of exactly the same shades of the colour of as the vertices and .
- âą
For the index such that , the vertices and are connected by edges of exactly the same shades of the of colour as the vertices and except that there is no edge between and with the shade of .
Informally speaking, these conditions are saying that the shade of the edges between any two of the vertices and the shades of the edges between the vertices and the vertices of colours are the same as the corresponding shades for except that between the vertices and the shade of the edge between and is missing. Here, denotes the vertex for the index such that .
Now, for each edge with endpoints and , for , there exists an edge between and such that has the same shade of colour as . Let be the collection of all these edges together with the unique edge between and (recall that there is only one shade of colour and that the edge between and is the only edge in incident to ). Then forms a tree which is isomorphic to and the corresponding edges are coloured the same way. As is a downward tree of size , this implies that is also a downward tree of size . Furthermore, between any two vertices of there exist edges in exactly the same shades as between the corresponding vertices of . Finally, every vertex in one of the colours has edges in the same shades to the vertices of as to the corresponding vertices of except that the shade of the edge is missing between the vertices and (recall that is an edge between and ).
Thus, for every partial copy of in containing we can form a corresponding partial copy of in containing but not containing , by simply replacing each of the vertices in the image of the partial copy by (and by replacing each edge in the image of the partial copy by an edge of the same shade between the corresponding vertices). This process is bijective and does not change which shades of which colours occur among the edges in the image of the partial copy. It also does not change the weight of the partial copy. Thus, for every , the quantity is equal to the sum of the weights of the -coloured partial copies of in that contain but not . In other words, we have for all .
Since and are downward trees of size , we have by the induction assumption. Hence , as desired. â
Now, note that each downward tree of size contains exactly one edge in each of the colours . For each , let be the shade of the edge of colour in . Then for any tuple with we have , since by definition there cannot be any -coloured partial copies of containing . That is to say, is a (possibly zero) multiple of the indicator function of . Since the set of indicator functions of each span the space of all functions , it suffices to prove the following lemma in order to finish the proof of Lemma 8.5.
Lemma 8.9**.**
For every there exists a downward tree of size in such that .
Proof.
Fix . Recall that is the sum of the weights of all -coloured partial copies of in whose image contains all edges in . Since each partial copy of in has positive weight, it suffices to show that there exists a -coloured partial copy of in which contains some downward tree of size .
Since is complete, we can recursively choose vertices in such that for each , the vertex has colour and for any there are edges of all shades of colour between and . Also, since is connected, and has vertices, we can choose a -edge subtree of . Let be the set of vertices of this subtree and note that . Choose one leaf of and call it . Now, define by mapping to the uncoloured vertex of and mapping the remaining vertices of to in order of decreasing distance to in the tree (this means the vertex of with maximum distance from in the tree will be mapped to , the vertex with second-largest distance to , and so on), where we break ties arbitrarily.
Note that the image of contains one vertex of each colour (the vertices ) and the uncoloured vertex. By the choice of , we can extend to a -coloured partial copy of in (we have already defined the way maps the relevant vertices of , we just need to define the way it maps edges). Let be the image of our subtree of . We can check that is a downward tree of size . â
9 Proofs of Lemmas 6.3 and 7.3
In this section we finally prove Lemmas 6.3 and 7.3, using our new anticoncentration inequality in Section 3 and the functions defined in Section 8. In Section 9.1 we make some definitions and state some auxiliary lemmas. Most importantly, we explain how to define cores in such a way that the functions represent the typical effects of changing the status of certain edges. In Sections 9.2 and 9.3 we prove the auxiliary lemmas, and we put everything together in Section 9.4.
9.1 Preparations
First, we define functions , measuring the change to that results from changing an edge in . In our proof of Lemma 7.3, taking , these functions will correspond to the that appear in the statement of Theorem 3.1. Recall the definition of the graph from Definition 4.5.
Definition 9.1**.**
Let be a colour system with parameters . Then, given two distinct vertices and in , a -tuple , and a graph on the vertex set , let be the number of labelled copies of in the graph which use the edge as well as at least one vertex of each of the colours. (Here, is the graph obtained from by adding the edge if this edge is not already present.)
Next, we need a similar definition for the proof of Lemma 6.3. Recall that Lemma 6.3 concerns functions , which are obtained by averaging functions of the form over . We will be interested in the effects on of adding or removing vertices from the âneighbourhoodâ sets in , which is equivalent to changing the status of edges incident to one of the vertices of colour . We define functions (where is an uncoloured vertex and is a vertex of colour ) to measure the average effects of such changes.
Definition 9.2**.**
Given a restricted colour system with parameters where , an outcome of the random collection of sets in Definition 4.8, a vertex , and a vertex of colour in , let be the function given by
[TABLE]
for all . Here, is a random graph on the vertex set .
Now, we want to show that the typical values of the and can be expressed in terms of functions . First, we consider , for the proof of Lemma 6.3. It will suffice to restrict our attention to the cases where comes from a very ârichâ subset of the uncoloured vertices.
Definition 9.3**.**
For a restricted colour system with parameters , let be the set of all uncoloured vertices in which are connected to all vertices of the colours in all possible shades of these colours.
When a general position assumption is satisfied, has linear size (by Lemma 5.1), as follows.
Fact 9.4**.**
Fix integers . If is an essentially -general restricted colour system of order which has parameters then .
We remind the reader that (as in the rest of this paper) the asymptotics in 9.4 are as , while and the parameters of the restricted colour system are treated as fixed constants for all asymptotic notation. Now, the relevant core for Lemma 6.3 is as follows.
Definition 9.5**.**
Given a restricted colour system with parameters , we can obtain a core with parameters as follows. Consider all coloured vertices of together with one additional uncoloured vertex which we connect to all coloured vertices by edges in all possible shades. We call the core of the restricted colour system .
Note that if a restricted colour system is complete, then its core is also complete (recall that being complete only depends on the edges between the coloured vertices). The following lemma gives the connection between the functions and the functions . It will be proved in Section 9.2.
Lemma 9.6**.**
Fix integers and . Let be an essentially weakly -general restricted colour system of order with parameters . Furthermore, let be an outcome of the random collection of sets in Definition 4.8, let and let be a vertex of colour in . Finally, let be the core of the restricted colour system , and let be the unique edge from to the uncoloured vertex in (recall that is a vertex of colour in , and that the colour only has one shade). Then, if the colour system is -general, we have
[TABLE]
Next, we turn to the functions , for the proof of Lemma 7.3. For this, we consider cores of a different type.
Definition 9.7**.**
Given a colour system with parameters , define its extended core to be the core with parameters obtained as follows. Start with all coloured vertices of . Now, for each possible choice of subsets for each and each vertex of colour , add a vertex of colour which is connected to all the vertices of colours with edges of exactly the shades given by the set . Finally, add one uncoloured vertex and connect it to all coloured vertices by edges in all possible shades (including exactly one shade of colour ).
Note that in Definition 9.7, there are precisely different choices for all the subsets . Thus, vertices of colour get added and the resulting core indeed has parameters . Furthermore, note that if the colour system is complete, then its extended core is a complete core.
In a similar way to Lemma 9.6, for the proof of Lemma 7.3 it will suffice to restrict our attention to those where and belong to certain special sets of uncoloured vertices.
Definition 9.8**.**
Let be a colour system with parameters and let be the extended core of . Let be the set of edges connecting the uncoloured vertex in to some vertex of colour in (such an edge is uniquely determined by because colour only has one shade). For each such , we define the subset as follows. Let consist of all those uncoloured vertices in such that is connected to all vertices of of colours in precisely the same shades of the corresponding colours in which the vertex is connected to these vertices in . Also, let be the set of all uncoloured vertices in which are connected to all vertices of of colours in all possible shades of these colours.
Note that is a special case of , for the edge connecting the uncoloured vertex of to the unique vertex of colour in which is connected to all vertices of colours in all possible shades of these colours. Also note that the sets , for , form a partition of . We will need a counterpart of 9.4 (again a simple consequence of Lemma 5.1): when a general position assumption is satisfied, each has linear size, as follows.
Fact 9.9**.**
Fix integers . Let be a weakly -general colour system of order with parameters , and let be the extended core of . Then for every , we have . In particular, .
The next lemma will be proved in Section 9.3.
Lemma 9.10**.**
Fix integers and . Let be a weakly -general colour system of order with parameters , and let be the extended core of . Consider an edge , and consider the sets and as defined as in Definition 9.8. Then for any distinct vertices and the following holds. If we choose a random graph on the vertex set , then with probability we have
[TABLE]
9.2 Proof of Lemma 9.6
In this subsection we prove Lemma 9.6. Throughout the subsection, fix integers and (in order to prove Lemma 9.6 for these values). For all asymptotic notation, these fixed values are treated as constants.
As in the statement of Lemma 9.6, let be an essentially weakly -general restricted colour system of order with parameters . Let be an outcome of the random collection of sets in Definition 4.8 such that is -general, let and let be a vertex of colour in . Finally, let be the core of the restricted colour system , and let be the unique edge from to the uncoloured vertex in . We need to prove that
[TABLE]
Now, let us define slightly modified versions of and the , that are easier to work with. For any outcome of the random graph and any , let be the number of labelled copies of in the graph which use the edge and which use exactly one vertex of each of the colours (for we considered the number of copies of which use at least one vertex of each colour). Then, we define analoguously to :
[TABLE]
for all .
Note that for any outcome of the random graph and any the difference is precisely the number of labelled copies of in the graph which use the edge as well as at least one vertex of each of the colours and which use at least two vertices of the same colour. Each such labelled copy has to use at least of the coloured vertices and also the vertex . Hence it can use at most vertices in . Thus, the number of such labelled copies is always at most . It follows that
[TABLE]
So, to prove Lemma 9.6 it suffices to prove the following lemma.
Lemma 9.11**.**
We have
[TABLE]
Proof.
We need to show that for every we have
[TABLE]
So let us fix some . For every let us refer to shade of colour as the desired shade of colour .
Recall that is the expected number of labelled copies of in the graph which use exactly one vertex of each of the colours and use the edge . We organise these copies by how they interact with the coloured vertices, as follows.
Consider any graph homomorphism , such that is a subset of vertices of and such that the image of contains exactly one vertex of each of the colours and the edge (let be the set of all such homomorphisms). Let be the expected number of labelled copies of in the graph that extend by mapping the vertices in into (where the expectation is taken over the random choice of ). Then we have
[TABLE]
Now, since is -general, we can estimate the as follows.
Claim 9.12**.**
For each as above, we have
[TABLE]
Proof.
Let be the set of those vertices such that is a coloured vertex (i.e. ). So, . Recall that is the expected number of labelled copies of in the graph that extend by mapping the vertices in into . For every vertex , let be the set of possible choices for the image of that are compatible with the map on . More precisely, is the set of vertices such that for every neighbour of , the vertex is connected to in the desired shade of the colour of the vertex . Let be the number of -tuples in whose vertices are distinct (that is, the number of ways to choose a distinct vertex from each ). Then , and
[TABLE]
Indeed, if we choose possible images for all the vertices (there are such choices), then each of the edges of inside needs to be mapped to an edge of and the probability for this to happen is .
Now, the sizes of the are dictated by our assumption that is -general. Fix a vertex and let be its neighbours in . Let be the neighbourhoods of the vertices in in the desired shades of the colours of , respectively. Then the set of possible choices for the image of is . So, differs from \mathopen{}\mathclose{{}\left|N_{1}\cap\dots\cap N_{k}}\right| by at most 1. Now, if we consider all the neighbourhoods in of all vertices of colours in in all the respective shades of these colours, then these are subsets of the set in -general position (this is because is by assumption a -general colour system). Thus, by Lemma 5.1 we have
[TABLE]
Recall that , and that was the number of neighbours in of in , so
[TABLE]
or equivalently
[TABLE]
Finally, observe that , plus the sum of all the , for , is equal to . Indeed, since and intersect in only one vertex , every edge of is either between two vertices of , two vertices of , or between a vertex of and a vertex of . From Equation 9.3 we therefore conclude that
[TABLE]
which is equivalent to the desired bound. â
Now, the sum in Equation 9.2 is only over choices of , so Claim 9.12 implies that
[TABLE]
Finally, recall that is the set of homomorphisms of the form , with , such that the image of contains exactly one vertex of each of the colours and the edge . The core of the restricted colour system was defined (in Definition 9.5) in such a way that there is a bijective correspondence between and the set of -coloured partial copies of in which contain the edge . Recall that was defined (in Definition 8.4) as the sum of the weights of all -coloured partial copies whose image contains , and the weight of a partial copy was defined to be . So,
[TABLE]
The desired bound Equation 9.1 follows. â
9.3 Proof of Lemma 9.10
In this subsection we deduce Lemma 9.10 from Lemma 9.6. Throughout the subsection, fix integers and (in order to prove Lemma 9.10 for these values). For all asymptotic notation, these fixed values are treated as constants.
Let be a weakly -general colour system of order with parameters , let be the extended core of , and consider some . Fix distinct vertices and . We need to show that for a random graph , with probability we have
[TABLE]
for all . For the rest of the proof fix some : we will show that Equation 9.4 holds with probability (then the desired result will follow, taking a union bound over all choices of ).
By 9.9, each of the disjoint sets has size . Let be a set containing one representative from each , taking , but taking some vertex other than in . If we imagine that the vertices of are coloured with colour , then, by our choice of , the coloured vertices of together with form a colour system which looks the same as the extended core of except that the uncoloured vertex of is missing.
Now, for the rest of the proof, we condition on some outcome of the induced subgraph on the vertices in . To apply Lemma 9.6, we define a restricted colour system of order with parameters by starting with , colouring the vertices in with colour , and including all edges of our conditioned outcome of in a single shade of colour . By construction, the core of the restricted colour system is almost isomorphic to ; the only difference is that has the edges of between the vertices of colour , whereas has no edges of colour . (To be precise, there is a colour/shade-preserving graph isomorphism between and ).
Since we chose such that and , we have (that is, is uncoloured in ), and has colour in . Recall that , meaning that is connected to all vertices of colours in all possible shades of these colours. This implies . Now, let be the unique edge in the core between and the uncoloured vertex of . As , this edge corresponds to the edge in under the isomorphism in the previous paragraph. Note that the functions do not actually depend on the edges between the vertices of colour in (since the partial copies of in use exactly one vertex of colour ). Thus, we have .
We have conditioned on an outcome of . For each , let be the (random) set of neighbours of in , and let be the induced subgraph on . Then, with we have in the notation of Definition 4.8. It follows that , because every labelled copy of using the edge as well as at least one vertex of each of the colours automatically also uses a vertex of colour (namely ). Thus, Equation 9.4 is equivalent to the inequality
[TABLE]
where , and is a collection of random sets with respect to the restricted colour system as in Definition 4.8.
Note that is essentially weakly -general, due to the way it was defined in terms of . So, by Lemma 5.5, is -general with probability at least , and if is -general then by Lemma 9.6 we have . So, to conclude the proof of Lemma 9.10, it suffices to prove the following claim.
Claim 9.13**.**
With probability we have
[TABLE]
Proof.
Condition on a fixed outcome of (so that only remains random). Recall that by Definition 9.2 we have . Consider the vertex-exposure martingale for (with respect to ), where we fix an ordering of (ending with ) and at each step we consider the next vertex in our ordering and expose all the edges of incident to that vertex which have not yet been exposed. Changing the status of edges adjacent to a single vertex in changes the value of by at most . This is due to the fact that there can be at most different labelled copies of in the -vertex graph which use the edge as well as at least one vertex of each of the colours and the exposed vertex (as both and the exposed vertex are uncoloured). Thus, by the AzumaâHoeffding inequality the probability that Equation 9.6 fails to hold is at most
[TABLE]
as desired. â
9.4 Putting everything together
Proof of Lemma 7.3.
Recall that the statement of Lemma 7.3 is for fixed integers and (which were fixed throughout Section 7), and recall that . Let and be as in the statement of the lemma, and let be the extended core of (which is a complete core). Let , so that the random choice of can be encoded by a Bernoulli sequence , with one random bit for each of the possible edges of . Abusing notation slightly, we identify the integers with pairs of vertices in , so that we may write to indicate the random bit that encodes the presence of the edge .
Now, abusing notation, we index the coordinates of by tuples in (so that we may talk about the -coordinate of a vector in ). Let be the vector-valued function defined such that, for corresponding to a graph , the -coordinate of is . With this definition, and the notation of Theorem 3.1, the random vector corresponds to the function .
The plan is to now apply Theorem 3.1 with and and with the taking the role of the vectors . For each edge , let be the vector corresponding to the function , and let be the vector corresponding to the function . By 9.9, there is some such that for each edge , there are pairs of vertices with . Let be the set of these pairs (observe that all the are disjoint). By Lemma 9.10, for each we have \Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left\|\Delta_{\{u,v\}}\boldsymbol{f}\mathopen{}\mathclose{{}\left(\boldsymbol{\xi}}\right)-s\boldsymbol{\gamma}_{e}}\right\|_{\infty}\geq r}\right)\leq n^{-\omega(1)} for and some . Note that , and by Lemma 8.5, the vectors span . We can now apply Theorem 3.1 to obtain
[TABLE]
(The implied constants in the above asymptotic notation may a priori depend on , but note that there are only finitely many possibilities for a core with parameters ). Finally, to conclude the proof we recall that \mathopen{}\mathclose{{}\left\|\psi_{H}(\mathcal{G},G_{0},\cdot)-\lambda}\right\|_{\infty}=\mathopen{}\mathclose{{}\left\|\boldsymbol{f}\mathopen{}\mathclose{{}\left(\boldsymbol{\xi}}\right)-\boldsymbol{x}}\right\|_{\infty} and observe that for large . â
Proof of Lemma 6.3.
This proof is very similar to the proof of Lemma 7.3. Recall that the statement of Lemma 6.3 is for fixed integers and (which were fixed throughout Section 6), and recall that . Let and be as in the statement of the lemma, and let be the core of . Let , so that a random choice of as in Definition 4.8 can be encoded by a Bernoulli sequence , with one random bit for each potential element in each . Abusing notation slightly, we identify with ordered pairs of vertices: for and a vertex of colour we write for the random bit that encodes the presence of in .
Let be the vector-valued function (with coordinates indexed by ) defined such that, for corresponding to an outcome of , the -coordinate of is . Then, corresponds to the function . For each , let be the vector corresponding to , and let be the vector corresponding to .
By 9.4, there is some such that . For each edge between the uncoloured vertex of and some vertex of colour , let . By Lemma 5.5 the colour system is -general with probability , in which case, by Lemma 9.6, for each we have \mathopen{}\mathclose{{}\left\|\Delta_{(u,v)}\boldsymbol{f}\mathopen{}\mathclose{{}\left(\boldsymbol{\xi}}\right)-s\boldsymbol{\gamma}_{e}}\right\|_{\infty}\leq r for and some . Also, by Lemma 8.5, the vectors span .
We can now apply Theorem 3.1 to obtain \Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left\|\boldsymbol{f}\mathopen{}\mathclose{{}\left(\boldsymbol{\xi}}\right)-\boldsymbol{x}}\right\|_{\infty}<r\sqrt{N\log N}}\right)\leq n^{-T/2+o(1)}. Finally, to conclude the proof we observe that for large , and recall that \mathopen{}\mathclose{{}\left\|\mu_{\mathcal{G},\mathcal{S}}-\lambda}\right\|_{\infty}=\mathopen{}\mathclose{{}\left\|\boldsymbol{f}\mathopen{}\mathclose{{}\left(\boldsymbol{\xi}}\right)-\boldsymbol{x}}\right\|_{\infty}. â
10 Concluding remarks
We have proved that for connected and constant , we have . There are several interesting directions for future research. Most obviously, Conjecture 1.1 remains open: for connected we are still a factor of away from an optimal bound, and for disconnected we do not even have a bound that improves as grows (the best general bound is \Pr\mathopen{}\mathclose{{}\left(X_{H}=x}\right)\leq\Pr\mathopen{}\mathclose{{}\left(\mathopen{}\mathclose{{}\left|X_{H}-x}\right|\leq n^{v\mathopen{}\mathclose{{}\left(H}\right)-2}}\right)=O\mathopen{}\mathclose{{}\left(1/n}\right), as we mentioned in the introduction). It seems that the ideas in this paper are robust enough to give certain nontrivial bounds (in terms of the size of the largest component of ) even in the disconnected case, but we have not explored this further.
For certain graphs , a possible route to a proof of Conjecture 1.1 might be via a local central limit theorem, which one might hope to prove by extending the methods of Gilmer and Kopparty [20], and Berkowitz [6, 7]. Basically, this involves carefully estimating the characteristic function , using different arguments for different ranges of . We remark that is small if the distribution of is not too biased mod , which seems comparable to anticoncentration of at âscaleâ . So, we wonder whether the ideas in this paper might be helpful for estimating : recall that our argument proceeds by breaking up into a sum of many random variables that fluctuate at different scales. However, we emphasise that local central limit theorems do not seem to be the right path to a proof of Conjecture 1.1 in its full generality: for example, if is a disjoint union of an edge and a 2-edge path, then the probability that is odd is substantially different from the probability that it is even (see [16]), meaning that does not obey a local central limit theorem.
Also, let be the number of (possibly non-injective) homomorphisms from into . This random variable is very closely related to , and we remark that with very minimal changes, one can modify our proof of Theorem 1.2 to prove the corresponding theorem for , when is connected. Interestingly, the homomorphism-counting analogue of Conjecture 1.1 fails dramatically in general: if is the disjoint union of two copies of a graph , then , meaning that has the same point probabilities as . This means that any proof of Conjecture 1.1 must be sensitive to the difference between subgraph counts and homomorphism counts.
It would also be interesting to consider the âsparseâ regime where is allowed to decay with . For example, it is known that if is strictly balanced (see for example [23, Section 3.2]) and is such that , then . Could it be that under these conditions we also have that ?
As mentioned in [19] it may also be interesting to study anticoncentration of the number of induced copies of a subgraph in a random graph \mathbb{G}\mathopen{}\mathclose{{}\left(n,p}\right). (This question was also raised by Meka, Nguyen and Vu [30]). The natural analogue of Conjecture 1.1 is that for a fixed graph and fixed p\in\mathopen{}\mathclose{{}\left(0,1}\right), we have
[TABLE]
We remark that the behaviour of \sqrt{\operatorname{Var}\mathopen{}\mathclose{{}\left(X_{H}^{\prime}}\right)} is not entirely trivial: for most values of it has order , but when is exactly equal to the edge-density of it may have order or (see [23, Theorem 6.42]).
Finally, it would be interesting to prove similar anticoncentration results in other combinatorial settings. One important example is random subsets of the integers (or other groups): for instance, what can we say about anticoncentration of the number of -term arithmetic progressions in a random subset of ? Arithmetic configuration counts have been an interesting analogue to subgraph counts in a number of other settings, for example in the study of large deviations (both fall in the framework of nonlinear large deviations initiated by Chatterjee and Dembo [12]; see for example [22] and the references therein). Another interesting direction of research would be to consider subgraph counts in random -uniform hypergraphs, or for other random graph models (for example, the uniform distribution on graphs with a fixed set of vertices and exactly edges).
Remark added in proof. While this paper was under review, Sah and Sahwney [36] proved Conjecture 1.1 for connected (via a local limit theorem) and disproved Conjecture 1.1 in general.
Acknowledgements. We thank the referees for their careful reading, and for many helpful comments.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] N. Alon and J. H. Spencer, The probabilistic method , fourth ed., Wiley Series in Discrete Mathematics and Optimization, John Wiley & Sons, Inc., Hoboken, NJ, 2016.
- 2[2] F. Augeri, Nonlinear large deviation bounds with applications to traces of Wigner matrices and cycles counts in Erdös-Renyi graphs , Ann. Probab., to appear.
- 3[3] A. D. Barbour, Poisson convergence and random graphs , Math. Proc. Cambridge Philos. Soc. 92 (1982), no. 2, 349â359.
- 4[4] A. D. Barbour, M. KaroĆski, and A. RuciĆski, A central limit theorem for decomposable random variables with applications to random graphs , J. Combin. Theory Ser. B 47 (1989), no. 2, 125â145.
- 5[5] A. Basak and R. Basu, Upper tail large deviations of regular subgraph counts in ErdĆs-RĂ©nyi graphs in the full localized regime , ar Xiv preprint ar Xiv:1912.11410 (2019).
- 6[6] R. Berkowitz, A quantitative local limit theorem for triangles in random graphs , ar Xiv preprint ar Xiv:1610.01281 (2016).
- 7[7] R. Berkowitz, A local limit theorem for cliques in G â ( n , p ) đș đ đ G(n,p) , ar Xiv preprint ar Xiv:1811.03527 (2018).
- 8[8] B. B. Bhattacharya, S. Ganguly, E. Lubetzky, and Y. Zhao, Upper tails and independence polynomials in random graphs , Adv. Math. 319 (2017), 313â347.
