Upper tail bounds for Stars
Matas \v{S}ileikis, Lutz Warnke

TL;DR
This paper derives nearly optimal exponential bounds for the upper tail probabilities of the number of r-armed stars in a binomial random graph, extending previous results to a broader range of deviations.
Contribution
It establishes the best possible exponential bounds for upper tail probabilities of star counts in G_{n,p}, solving a problem posed by Janson and Rucinski and confirming a conjecture by DeMarco and Kahn.
Findings
Derived exponential bounds are tight up to constant factors.
Extended upper tail analysis to deviations larger than constant .
Confirmed conjecture and solved open problem for star counts.
Abstract
For r \ge 2, let X be the number of r-armed stars K_{1,r} in the binomial random graph G_{n,p}. We study the upper tail \Pr(X \ge (1+\epsilon)\E X), and establish exponential bounds which are best possible up to constant factors in the exponent (for the special case of stars K_{1,r} this solves a problem of Janson and Rucinski, and confirms a conjecture by DeMarco and Kahn). In contrast to the widely accepted standard for the upper tail problem, we do not restrict our attention to constant \epsilon, but also allow for \epsilon \ge n^{-\alpha} deviations.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Upper tail bounds for Stars
Matas Šileikis and Lutz Warnke Department of Theoretical Computer Science, Institute of Computer Science of the Czech Academy of Sciences, 182 07 Prague, Czech Republic. E-mail: [email protected]. With institutional support RVO:67985807. Research supported by the Czech Science Foundation, grant number GJ16-07822Y.School of Mathematics, Georgia Institute of Technology, Atlanta GA 30332, USA. E-mail: [email protected]. Research partially supported by NSF Grant DMS-1703516 and a Sloan Research Fellowship. Part of the work was done while the author was a member of the Department of Pure Mathematics and Mathematical Statistics, University of Cambridge.
(29 January 2019)
Abstract
For , let be the number of -armed stars in the binomial random graph . We study the upper tail , and establish exponential bounds which are best possible up to constant factors in the exponent (for the special case of stars this solves a problem of Janson and Ruciński, and confirms a conjecture by DeMarco and Kahn). In contrast to the widely accepted standard for the upper tail problem, we do not restrict our attention to constant , but also allow for deviations.
1 Introduction
The study of (the distribution of) small subgraphs in the binomial random graph is one of the most fundamental and influential problems in the theory of random graphs. Starting with the seminal work of Erdős and Rényi [11] from 1960, the early results for the number of copies of in concerned the threshold of appearance (i.e., when ) and the range of edge-probabilities for which is asymptotically normal; these basic features were eventually resolved in the 1980s by Bollobás [2] and Ruciński [25]. Later the focus changed to finer details of the distribution of , and the lower tail was studied intensively in the late 1980s (often for the special case ). This led to the discovery of Janson’s inequality [13, 14, 24], which gives exponential bounds for that are best possible up to constant factors in the exponent (cf. the recent work of Janson and Warnke [20]).
Since the early 1990s the ‘infamous’ upper tail has remained an important challenge, providing a well-known testbed for concentration inequalities (see, e.g., [16]). After polynomial bounds around 1990 by Spencer [29] and exponential bounds in the late 1990s via the Kim–Vu polynomial concentration method [21, 30], in 2002 Janson, Oleszkiewicz and Ruciński [17] obtained a breakthrough: via a moment based method they obtained exponential estimates for that, for constant , are best possible up to logarithmic factors in the exponent (see also [9, 19] for extensions to random hypergraphs, and arithmetic progressions in random subsets of integers). The upper tail problem of closing the aforementioned logarithmic gap has remained open during the last decade, and only recently this has been settled for cliques by DeMarco and Kahn [6, 7] (see also Chatterjee [3] for ), and for arithmetic progressions by Warnke [31]. Modern large deviation theory also gives partial results [4, 23, 1, 8] for large edge-probabilities (this restriction sidesteps some difficulties of the upper tail problem).
In this paper we solve the upper tail problem for -armed stars , and as a conceptual novelty we will also allow to depend on (i.e., do not restrict our attention to constant , as usual). The casual reader might suspect that tail estimates for -armed stars are essentially trivial, but this is only true for (where since ). To put this into context, Janson, Oleszkiewicz and Ruciński [17] proved that for -regular graphs , such as cliques , the upper tail satisfies
[TABLE]
where the subscripts in and indicate that the implicit constants may depend on and . They also highlighted (with ) as key example where the form of the exponent is more complicated, i.e, has different expressions for different ranges of . This surprising intricacy is further manifested by the history of the infamous upper tail problem. Namely, Vu [30] argued in 2000 that his general results were essentially unimprovable due to -armed stars, for which he obtained bounds of the form
[TABLE]
However, Janson, Oleszkiewicz and Rucinski [17] later discovered that the upper tail behavior of is more delicate (the lower bound in (1) is not always correct), and obtained bounds of the more involved form
[TABLE]
In words, for stars the form of the upper tail changes around , which is an intriguing phenomenon (that does not occur for cliques). In fact, a recent conjecture of DeMarco and Kahn [7, 28] for general suggests that in (2) the ‘correct’ exponent involves yet another term , see (3). However, despite some partial results [26, 27, 33], the quest for matching bounds in (1)–(2) remained open.
1.1 Main results
Our first basic result settles the upper tail problem of -armed stars for constant , by closing the existing gap in the exponent for all . In particular, (3) below confirms111Using Corollary 1.8 in [17] and the discussion of Remark 8.3 in [17] it is not difficult to check that the special case of Conjecture 10.1 in [7] indeed reduces to (3) with ; see also equation (4.27) in [27] and Remark 2 in [26]. Conjecture 10.1 of DeMarco and Kahn [7] in the special case . For subgraph counts this is the first example of a sharp upper tail estimate where, for constant , the form of the exponent undergoes multiple phases (i.e., has more more than two different expressions for different ranges of ).
Theorem 1** (Upper tail problem for constant ).**
Given , let be the number of copies of in . Set . For and satisfying we have
[TABLE]
Note that the assumption is necessary (since is impossible), and that the assumption is natural (since otherwise holds). We now motivate the intricate form of the exponent in (3) for . First, Poisson approximation heuristics suggest that for small edge-probabilities . Second, it turns out that appropriately clustered222For example, complete bipartite graphs with suitable and suffice: they contain stars and edges; see Lemma 14 for more details. edges suffice to create copies of , which implies . Intuitively, Theorem 1 confirms that the more likely of these two mechanisms (the one with larger probability) controls the upper tail behaviour for constant .
Our second result determines the correct dependence of the stars upper tail on , up to constant factors in the exponent (this contrasts Theorem 1 above, where the implicit constants may depend on ). In particular, (4) below solves Problem 6.1 of Janson and Ruciński [18] in the special case . For subgraph counts this is the first example where, for bounded away from one, the order of the large deviation rate function is determined for of form (the assumption is natural, since it ensures that we are dealing with exponentially small probabilities).
Theorem 2** (Upper tail problem for ).**
Given , let be the number of copies of in . Set , , and . Given there is such that, for and satisfying and , we have
[TABLE]
with
[TABLE]
Remark 3**.**
The variance satisfies ; see, e.g., Lemma 3.5 in [15]. Furthermore, if holds, then in (5) we can replace by ; see Lemma 12.
Conjecture 4** (Correct upper tail behaviour).**
Theorem 2 remains valid without the assumption .
We now motivate the somewhat unusual form of the exponent in (4). First, normal approximation heuristics333The same normal heuristic suggests that in (3) we should perhaps have used instead of , but it turns out that then the term would only matter for (i.e., determine the minimum) in a range of where holds. suggest that for very small , and this sub-Gaussian tail is consistent with the term in (5) since as (the function is well-known from Chernoff bounds). Second, in we usually expect to have at least copies of , say, so enforcing extra copies via appropriately clustered444As before, complete bipartite graphs with suitable and suffice (see Lemma 14). edges should thus be enough to give a total of copies of ; this heuristic loosely suggests . Intuitively, Conjecture 4 predicts that the form of the upper tail is indeed determined by either sub-Gaussian or ‘clustered’ behaviour, and Theorem 2 confirms this for .
Our third result approaches the upper tail problem from a conceptually slightly different perspective, studying for general deviations (this contrasts Theorem 1 and 2 above, where we focus on the large deviations range and then put restrictions on ). For subgraph counts, inequality (6) below is the first example where, for moderately large edge-probabilities , the order of is completely resolved for all exponentially small deviations (where is the natural target assumption). We complement this result with inequality (7) below, which is the first example where the order of is resolved for nearly all deviations where the ‘clustered’ behaviour determines the exponent (here is the natural target assumption for ; see (5), Remark 3, and Conjecture 4).
Theorem 5** (General upper tail bounds: moderate deviations and clustered regime).**
Given , let be the number of copies of in . Set and . Given , then the following holds whenever and .
- (i)
If and , then
[TABLE] 2. (ii)
If and satisfies , then
[TABLE]
By Remark 3, inequalities (6)–(7) provide further evidence for Conjecture 4 (and verify it for ).
1.2 Some comments
The main focus of this paper are upper bounds on the upper tail . Developing [31, 33], here our high-level proof strategy is based on the idea that (after ignoring certain ‘bad’ events with negligible probabilities) using combinatorial arguments we can find a ‘well-behaved’ subgraph in the sense that (i) the number of stars in and are approximately the same (differ by at most , say), and (ii) the maximum degree of is ‘not too large’ (which intuitively helps for showing concentration of stars). Using modern Chernoff-like upper tail bounds, we then show that it is very unlikely to have a ‘bad’ subgraph with ‘not too large’ maximum degree and ‘many’ stars (at least many, say). Putting things together, the punchline is then that we can only have many stars if one of the discussed unlikely ‘bad’ events occur, which (after some technical work) eventually gives the desired upper bounds on the upper tail ; see Section 2 for more details.
Finally, let us briefly compare our upper tail results for stars with very recent results from the large deviation theory literature, which are spearheaded by Chatterjee, Dembo, Lubetzky, Varadhan, Zhao, and many others (see, e.g., [5, 22, 4, 23, 10, 1, 8]). For general , these aim at determining the asymptotics of for constant and large edge-probabilities of form or . For stars , inequality (4) from Theorem 2 is weaker in the sense that it only determines up to constant factors, but it is stronger in the sense that it covers a much wider range of the parameters, including and all of interest. Obtaining such tail estimates with increased ranges of applicability is useful for combinatorial applications, where one is usually ‘willing to give up a little bit on the tail’, in particular on the ‘inessential numerical constants’ in the exponent (see [30, 18]). Furthermore, estimates of form (6)–(7) are also quite satisfactory from a concentration inequality perspective. Overall, we hope that our results stimulate more research into such estimates for other graphs .
1.3 Organization
In Section 2 we prove the upper bounds on the upper tail from Theorem 1, 2, and 5 (and discuss a simple extension). The corresponding (fairly routine) lower bounds are then established in Appendix A.
2 Upper bounds on the upper tail
In this section we establish the upper bounds on the upper tail from Theorem 1, 2, and 5. Our core argument has two strands. In the first combinatorial part we iteratively decrease the maximum degree of the random graph by edge-deletion (the idea is to remove large stars with from ) until the final graph has sufficiently low maximum degree, say at most . This degree bound allows us to estimate the number of stars in via a ‘well-behaved’ auxiliary random variable . Taking into account the number of stars which are removed when passing from to , this allows us (by means of a technical event ) to approximate the number of copies of in using and several further auxiliary random variables (which intuitively bound the number of in ). In the second probabilistic part we then estimate the upper tails of these auxiliary variables using a concentration inequality of Warnke [31] and ad-hoc arguments (exploiting the careful definitions of the variables and given in Section 2.1). Putting things together, the core argument then proceeds roughly as follows: by the combinatorial part can only happen if at least one of the auxiliary variables or is ‘large’, and by the probabilistic part the probability of this ‘bad’ event is at most the desired ‘correct’ upper tail probability (for suitable choices of the degree constraint and other parameters).
In Section 2.1 we first illustrate this argument for the simpler setup of Theorem 1, and in Section 2.2 we then extend the argument to the more precise tail estimates of Theorem 2 and 5. Finally, in Section 2.3 we also briefly discuss a straightforward extension (to a certain sum of iid variables).
2.1 Core argument for Theorem 1
We start by introducing the main random variables and events for Theorem 1 (as we shall see, their careful definitions will facilitate the interplay between the combinatorial and probabilistic parts of our argument). For , let denote the maximum number of copies of in any subgraph with maximum degree at most . For , let denote the maximum size of any collection of edge-disjoint in . For let denote the ‘technical’ event that
[TABLE]
where we tacitly used the following convenient parametrization:
[TABLE]
(In this subsection we shall only use ; working with general is convenient for the later extensions.)
The following combinatorial lemma is at the heart of our argument, and it intuitively states that whenever the event holds. Its proof is inspired by ideas developed in [31, 33], but contains several new ideas. For example, instead of iteratively sparsifying an auxiliary hypergraph (which encodes the edge-sets of all stars in ) we here iteratively sparsify the random graph itself. Furthermore, in order to obtain the correct tail behaviour, in inequality (8) we need to work with instead of the simpler choice suggested by [31] (we achieve this by adding an extra degree bound to the argument, bounding the initial maximum degree by instead of just ).
Lemma 6**.**
Given and , the event implies .
The lower bound of Lemma 6 is trivial. For the upper bound the idea is to iteratively decrease the maximum degree of , yielding . By bounding the number of which are removed when passing from to , this eventually allows us to estimate the total number of .
Proof of Lemma 6.
Define . Let be the smallest integer with . We set and inductively construct . Given , , let be a maximal set of edge-disjoint collection of stars . We remove all edges from which are incident to a centre vertex of some star in , and denote the resulting graph by .
Writing for the maximum degree of , we claim that for all . For we use a case distinction. If , then trivially . Otherwise , in which case (8) entails , so contains no , and follows. Further considering with , we note that by construction, because otherwise we could add another to (contradicting maximality).
With in hand, we now count the total number of copies of in . Note that, given an edge of with , we can construct any in containing by first selecting a centre vertex and then additional neighbours of . Hence in any edge is contained in at most copies of . Recalling the definition of , note that when, passing from to , we remove at most edges. So, since contains at most copies of , using (8) and it follows that
[TABLE]
Recalling and , using , and we infer
[TABLE]
which completes the proof. ∎
Applying Lemma 6 with , in the probabilistic part of the argument it remains to estimate and . We shall exploit the maximum degree constraint of via the following upper tail inequality of Warnke [31], which extends classical Chernoff bounds to random variables with ‘well-behaved dependencies’ (and allows us to go beyond the method of typical bounded differences [32]).
Theorem 7** (Corollary of [31, Theorem 9]).**
Let be a finite family of independent random variables with . Given a family of subsets of , consider random variables with , and suppose . Define , where the maximum is taken over all with . Set . Then, for all ,
[TABLE]
The main observation is that, in every subgraph with maximum degree at most , any star shares edges with other stars. For this allows us to routinely apply Theorem 7 with Lipschitz-like parameter , making inequality (13) plausible. For Theorem 1 the crux is that our choice of will ensure , so (13) suggests that fails with probability at most .
Corollary 8**.**
For all , and we have
[TABLE]
Proof.
Let contain all edge-subsets of that are isomorphic to . Writing , there is a subgraph with maximum degree at most such that for . Given , we construct all edge-intersecting stars as in the proof of Lemma 6, and infer
[TABLE]
It follows that , where is defined as in Theorem 7 with . It is well-known (and easy to check by calculus) that for we have
[TABLE]
Putting things together, using Theorem 7 and (15) it follows that
[TABLE]
which completes the proof of (13) by choice of (see (14) above). ∎
We shall estimate via a union bound argument and the following upper tail estimate for . The technical assumption (17) intuitively ensures that vertices with degree at least are unlikely. For Theorem 1 the crux is that our choice of will also ensure , so applications of inequality (18) with suggest that and thus (8) fails with probability at most , say.
Lemma 9**.**
For all , , and satisfying
[TABLE]
the following holds. For all we have
[TABLE]
Proof.
As for all integers , by exploiting the disjointness condition of we infer
[TABLE]
As the function is decreasing for , and (17) implies , we deduce
[TABLE]
Plugging this into (19) readily establishes inequality (18), since trivially when . ∎
For the proof of the upper bound of Theorem 1 it remains to pick suitable , i.e., which satisfies the technical assumption (17) and yields the ‘correct’ exponent in (13) and suitable applications of (18).
Proof of the upper bound in (3) of Theorem 1.
For concreteness, define and , as well as
[TABLE]
For later reference, we record that there is a constant such that, for ,
[TABLE]
By Lemma 6, the upper tail of the number of -copies satisfies
[TABLE]
Gearing up to bound via Lemma 9, using and inequality (20) together with the bound (as ) it follows that
[TABLE]
where here and below we shall always tacitly assume whenever necessary. Since the above calculation also gives , together with it follows that
[TABLE]
Applying a union bound argument, using estimates (18), , and (22) it follows that
[TABLE]
Recalling (21) and the definition of , by applying Corollary 8 with it follows that there is a constant and suitable parameters such that
[TABLE]
We find the above upper tail estimate very satisfactory, but in the literature it has become standard to suppress multiplicative factors such as in (24), which is straightforward when holds (rescaling the exponent by a factor of , say). In the remaining case Markov’s inequality gives
[TABLE]
Finally, noting then establishes the upper bound in (3). ∎
2.2 Extension of the argument to Theorem 2 and 5
We now extend the arguments from Section 2.1 to the upper bounds of Theorem 2 and 5. To obtain sub-Gausssian decay in the exponent of tail-inequality (13) for , in view of the well-known variance estimate from Remark 3 we here would like to pick for some range of . However this choice causes a major problem:555For another problem is that the technical assumption (17) of Lemma 9 then breaks when is close to one, which partially explains why in the upcoming Theorem 11 we shall exclude fairly small when . in the key estimate (22) we can no longer win an extra log-factor (via ) when we bound the variables using (18) from Lemma 9. Our strategy for overcoming this obstacle is to refine the technical event , by enforcing different upper bounds on when is small (so that in the probabilistic arguments we automatically win an extra logarithmic factor, without destroying the combinatorial counting arguments from Lemma 6).
Turning to the details, for let denote the ‘technical’ event that
[TABLE]
where, in addition to the parameters and from (9), we tacitly used
[TABLE]
Lemma 10**.**
Given and , the event implies .
Proof.
The proof of Lemma 6 carries over, except for the final inequalities (10)–(11) that bound from above. Recalling that , by mimicking the argument leading to (10) we here obtain
[TABLE]
Recalling and , using it then follows similarly to (10)–(11) that
[TABLE]
which completes the proof. ∎
We are now ready to prove the following slightly more general upper tail estimate for the number of -copies in , which (as we shall see) implies the upper bounds in Theorems 2 and 5.
Theorem 11** (Upper tail bounds: technical result).**
Given , let . Set , , and . Given , suppose that either
[TABLE]
holds, where the parameters and are defined in (9) and (27). Then we have
[TABLE]
Proof.
Let . We distinguish the following three cases: (i) , (ii) , and (iii) . Note that in all three cases we may assume , since decreasing yields a less restrictive assumption. Furthermore, in case (iii) we may also assume that holds (otherwise case (i) or (ii) apply). For concreteness, define
[TABLE]
(We remark that in cases (i)–(ii) the simpler choice suffices.) We defer the somewhat technical proofs of the following claims regarding Lemma 9: (a) assumption (17) holds, and (b) inequality (18) implies
[TABLE]
where here and below we shall again tacitly assume . Analogously to inequalities (21) and (24), by first applying Lemma 10 and Corollary 8, and then using , it follows that
[TABLE]
Since , this establishes inequality (28).
It remains to verify claims (a) and (b) above, and start with claim (a), i.e., that the assumption (17) of Lemma 9 holds. Note that . Furthermore, in case (i) we have , and in case (ii) we have and . So, in both cases, using we infer
[TABLE]
Proceeding analogously, in the cumbersome case (iii) it suffices to show . Using , and (20), it is routine to see that and . Assuming , by first using (15) and then distinguishing the cases (where ) and (where ), it follows that
[TABLE]
Assuming , we note that assumption implies (hence and thus , as noted above). Hence, by first using (15) and then the assumed lower bound on from case (iii), we infer
[TABLE]
Each time follows readily by definition of , establishing claim (a), as discussed above.
Finally, we verify claim (b), i.e., that inequality (18) implies estimate (29). We start by observing that if fails then a fortiori . Hence, using (18) with and , we deduce
[TABLE]
Analogously to (23), using inequality (18) and it also follows that
[TABLE]
We now use a fairly technical case distinction to verify that the two estimates (32)–(33) together imply (29). Assuming , analogously to the proof of (22) we have , so that
[TABLE]
Next we assume , which implies , so that
[TABLE]
In the remaining case and hold. Since implies , we infer . So, recalling that by (15) and that by (20), using , and we deduce that
[TABLE]
establishing . It follows that
[TABLE]
which together with inequalities (32)–(35) implies the claimed estimate (29). ∎
We now deduce the upper bounds of Theorem 2 and 5 from the upper tail inequality (28).
Proof of the upper bound in (4) of Theorem 2.
Let . For and it is routine to check that holds for sufficiently small. Hence Theorem 11 applies with , where by Remark 3. Using it follows that
[TABLE]
establishing the upper bound in (4). ∎
For Theorem 5 we shall simplify the form of the exponent in (28) via the following auxiliary result, writing instead of for typographic reasons (the assumption in (ii) is ad-hoc).
Lemma 12**.**
Given , the following holds whenever .
- (i)
If , then
[TABLE] 2. (ii)
If and , then implies
[TABLE] 3. (iii)
If and , then .
Proof.
Inequality (38) and the first two estimates of equation (39) follow immediately from (15) and , see Remark 3. We now turn to the final inequality of equation (39). By combining (15) and with and , using and , see (20), it follows similarly to (31) that
[TABLE]
where we exploited that calculus gives ; this completes the proof of claims (i)–(ii).
For claim (iii) we may of course assume (otherwise there is nothing to show). Hence implies by Remark 3, which in turn gives , because and together imply , completing the proof. ∎
Proof of the upper bound in (6) of Theorem 5.
Applying Theorem 11 (with ), using (i)–(ii) of Lemma 12 it follows that inequality (28) holds with in the exponent, where by (iii) of Lemma 12. Absorbing the factor similar to (37) then establishes the upper bound in (6). ∎
Proof of the upper bound in (7) of Theorem 5.
Since by Remark 3, note that the assumption
[TABLE]
implies , so that Theorem 11 (with ) applies. Using (40), by (iii) of Lemma 12 we also infer that . Absorbing the factor as before, it remains to show that the exponent of inequality (28) is . For this follows from (38) of Lemma 12 and (40). For this follows from (39) of Lemma 12, since (40) and imply and thus , as required. ∎
2.3 Straightforward extension to a certain sum of iid variables
We close this section by recording that minor (and in fact simpler) variants of our proofs also apply to the following sum of independent random variables:
[TABLE]
Indeed, in view of the structural similarities to the number of -armed stars in (which satisfies , writing for the degree of ), here we set , and define as the number of with . Now the proofs of Lemma 6 and 10 carry over with minor changes: exploiting that there are no dependencies between the , using a simple dyadic decomposition we here obtain
[TABLE]
For the proof of Corollary 8 it suffices to show that holds in the present setting. Since is a sum of independent indicators , we may write each as a sum of dependent indicators (which each are products of some distinct independent variables ). Using the constraint the analogous left hand side of (14) is thus bounded by , which in turn implies , as desired. Since the proof of Lemma 9 also remains valid (as inequality (19) carries over), we thus arrive at the following result.
Theorem 13** (Upper tail bounds: an extension).**
The upper bounds on the upper tail from Theorem 1, 2, 5, and 11 remain valid for the random variable defined in (41).
Perhaps surprisingly, we are not aware of any standard method or inequality (for sums of iid variables) which can routinely recover the upper tail bounds from Theorem 13. Here one technical difficulty seems to be that each summand has an upper tail that decays slower than exponentially (for ), which presumably is closely linked to the somewhat non-standard term in the exponent.
Acknowledgements. We would like to thank Svante Janson for a helpful discussion, and the CPC referee report from June 2015 (on an earlier version of this paper) for suggestions concerning the presentation.
Appendix A Appendix: Lower bounds on the upper tail
In this appendix we establish fairly routine lower bounds on the upper tail from Theorem 1, 2, and 5 (omitting some straightforward details). Following [31] we obtain our lower bounds via the following three events: that many copies of ‘cluster’ on few edges (see Lemma 14 and 16), that most copies of arise disjointly (see Lemma 15 and 17), and that contains more edges than expected (see Lemma 18).
A.1 Basic argument for Theorem 1
For Theorem 1 we shall use two different lower bounds, and the first one is based on the idea that relatively few edges (which ‘cluster’ in an appropriate way) can create fairly many stars . This is formalized by the following result, which implies since enforces .
Lemma 14** (Clustering).**
For every there is so that for all and there is with edges such that contains at least copies of .
Inspired by the proofs of Theorem 1.3 and 1.5 in [17], the idea is to use a complete bipartite graph with and , which contains edges and at least copies of (certain border cases require minor care).
Proof of Lemma 14.
Let , , and . If (i) and , then we let , with and . Note that exists, since it is easy to check that and , say (we leave the details to the reader). Furthermore, contains at least many , and edges.
If either (ii) or (iii) and , then we let , which trivially contains copies of , and edges.
Finally, if (iv) and , then we let , which contains at least vertex disjoint copies of and edges, completing the proof. ∎
The second lower bound is inspired by the fact that is approximately Poisson for small , in which case most arise disjointly. Indeed, the following standard result bounds from below by the probability that there are exactly vertex-disjoint copies of (see [7, 26, 31] for similar arguments), which for will imply ; the precise form of (42) will be useful later on.
Lemma 15** (Disjoint approximation).**
Given there are (depending only on ) such that, for all , and integers satisfying , we have
[TABLE]
Proof.
Let contain all copies of in . Define as the collection of all -element subsets of in which all stars are vertex disjoint. Given , define as the event that all stars of are present, and define as the event that none of the stars in are present. Note that
[TABLE]
Distinguishing the number of edges in which each star overlaps with some star from the vertex-disjoint collection , using Harris inequality [12] and we routinely obtain
[TABLE]
where . Furthermore, with , and in mind, basic counting (and a short calculation) gives
[TABLE]
This completes the proof of (42) since . ∎
Combining the above two results, we now prove the lower bound of Theorem 1.
Proof of the lower bound in (3) of Theorem 1.
We shall tacitly assume whenever necessary. Applying Lemma 14 with , there is with edges that contains at least copies of . If , then it follows that
[TABLE]
Otherwise , which by a short calculation implies , say (since implies and thus ). Applying Lemma 15 with , using , , and it follows that
[TABLE]
establishing the lower bound in (3). ∎
A.2 Refined arguments for Theorem 2 and 5
For Theorem 2 and 5 we shall refine the previous two lower bounds, and also introduce a new third lower bound. Each time some care is needed to obtain the ‘correct’ dependence on in the exponent, and we start by refining the ‘clustering’ based lower bound from Lemma 14 and (43).
Lemma 16** (Refined clustering bound).**
Given and there are (depending only on ) such that, for all , and satisfying , we have
[TABLE]
In case of the basic proof idea is to obtain copies of as follows: (i) we first use the clustering construction from Lemma 14 to plant copies of , and (ii) then use Harris’ inequality and a one-sided Chebychev’s inequality to show that typically of the remaining other copies of are present (the crux is that the expected number of such copies is , so having of them intuitively seems likely). For the resulting lower bound step (i) with probability thus ought to give the main contribution, making (45) plausible. For technical reasons, in the actual argument we have to plant copies of for carefully chosen . By mimicking the proof of Theorem 21 in [31] we then easily arrive at (45) above; we leave the details to the reader.
We next refine the ‘disjoint approximation’ based lower bound used in Lemma 15 and (44) for small . The idea is that inequality (42) intuitively relates to a binomial random variable with mean , which makes the following Chernoff-type bound for the upper tail plausible.
Lemma 17** (Disjoint approximation: Chernoff-type lower bound).**
Given there are (depending only on ) such that, for all , and satisfying , we have
[TABLE]
Noting the binomial-like form of inequality (42) it is routine to check that Lemma 15 indeed implies (46) above (e.g., by summing (42) as in the proof of Theorem 22 in [31]); we leave the details to the reader.
Our third lower bound for moderately large it is based on the idea that a deviation in the number of edges should typically entail a deviation in the number of copies (in concrete words: if has substantially more than edges, then we expect to have more copies than on average).
Lemma 18** (Deviation in number of edges: sub-Gaussian type lower bound).**
Given and there are (depending only on ) such that, setting , for all , and we have
[TABLE]
Remark 19**.**
By Remark 3, in inequality (47) we have , where .
Setting , the basic proof idea is to (i) condition on having edges, and (ii) then show that this conditioning converts into a typical event (the crux is that this conditioning drives up the expected value of ; to see this it might help to think of the uniform random graph with edges). For the resulting lower bound the conditioning thus ought to give the main contribution, which by folklore results satisfies {\mathbb{P}}(|E(G_{n,p})|\geq(1+\varepsilon)\binom{n}{2}p)=\exp\bigl{(}-\Theta_{\xi}(\varepsilon^{2}\binom{n}{2}p))\bigr{)}. This makes inequality (47) plausible, since for the considered range of . A simple modification of the proof of Theorem 24 in [31] makes this idea rigorous and establishes (47) above; we leave the details to the reader (we mention in passing that a tilting argument also works here).
Stitching the above three results together, we now prove the lower bounds of Theorem 2 and 5.
Proof of the lower bound in (7) of Theorem 5.
By (iii) of Lemma 12 we infer that , which in turn implies and thus . Hence an application of Lemma 16 (see inequality (45)) establishes the lower bound in (7). ∎
Proof of the lower bound in (6) of Theorem 5.
We shall only assume instead of . Applying Lemmas 16 and 18, and using Remark 19, it follows that there is such that
[TABLE]
By a virtually identical calculation as in the proof of (39) from Lemma 12, for it follows that holds. After adjusting the implicit constants, it follows that we can remove the indicator in inequality (48), which in view of establishes the lower bound in (6). ∎
Proof of the lower bound in (4) of Theorem 2.
Set and , as usual. Using (15) we have by assumption, so follows. In the following we shall distinguish the three cases (i) , (ii) , and (iii) .
In cases (i)–(ii) note that, say, holds. Using (i)–(ii) of Lemma 12, it thus suffices to prove the lower bound of (4) with exponent replaced by defined in (6). In case (i) this bound follows from the above proof (valid for ) of the lower bound in (6), and in case (ii) we shall now argue that this bound follows from inequality (45) of Lemma 16, by establishing that holds. Indeed, since and Remark 3 imply , after recalling and it then follows for sufficiently small (say, ) that
[TABLE]
completing the proof in cases (i)–(ii).
In the remaining case (iii) Lemmas 16 and 17 imply that, for some constant , we have
[TABLE]
We claim that for we have . Indeed, noting that for (which is easy to check by calculus), it follows that
[TABLE]
Furthermore, when , and when . In each case the claimed inequality holds, which allows omitting the indicator in (50). Since by Remark 3, now follows, which in view of completes the proof. ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] B. Bhattacharya, S. Ganguly, E. Lubetzky, and Y. Zhao. Upper tails and independence polynomials in random graphs. Adv. Math. 319 (2017), 313–347.
- 2[2] B. Bollobás. Threshold functions for small subgraphs. Math. Proc. Cambridge Philos. Soc. 90 (1981), 197–206.
- 3[3] S. Chatterjee. The missing log in large deviations for triangle counts. Random Struct. Alg. 40 (2012), 437–451.
- 4[4] S. Chatterjee and A. Dembo. Nonlinear large deviations. Adv. Math. 299 (2016), 396–450.
- 5[5] S. Chatterjee and S.R.S. Varadhan. The large deviation principle for the Erdős-Rényi random graph. European J. Combin. 32 (2011), 1000–1017.
- 6[6] B. De Marco and J. Kahn. Upper tails for triangles. Random Struct. Alg. 40 (2012), 452–459.
- 7[7] B. De Marco and J. Kahn. Tight upper tail bounds for cliques. Random Struct. Alg. 41 (2012), 469–487.
- 8[8] A. Dembo and N. Cook. Large deviations of subgraph counts for sparse Erdős–Rényi graphs. Preprint (2018). ar Xiv:1809.11148 .
