Subgaussianity is hereditarily determined
Pandelis Dodos, Konstantinos Tyros

TL;DR
This paper demonstrates that the subgaussian properties of linear combinations of bounded random vectors are fundamentally linked to the subgaussian behavior of their random subsets, revealing a hereditary structure.
Contribution
It establishes that subgaussianity of a linear combination is essentially determined by the subgaussianity of its random subsets, highlighting a hereditary property.
Findings
Subgaussianity is hereditarily determined by random subsets.
The behavior of the entire sum is linked to its parts.
Subgaussian properties can be inferred from subset behaviors.
Abstract
Let be a positive integer, let be a random vector in with bounded entries, and let be a vector in . We show that the subgaussian behavior of the random variable is essentially determined by the subgaussian behavior of the random variables where is a random subset of .
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Subgaussianity is hereditarily determined
Pandelis Dodos and Konstantinos Tyros
Department of Mathematics, University of Athens, Panepistimiopolis 157 84, Athens, Greece
Department of Mathematics, University of Athens, Panepistimiopolis 157 84, Athens, Greece
Abstract.
Let be a positive integer, let be a random vector in with bounded entries, and let be a vector in . We show that the subgaussian behavior of the random variable is essentially determined by the subgaussian behavior of the random variables where is a random subset of .
2010 Mathematics Subject Classification: 60E15, 60G99.
Key words: subgaussian random variable, subgaussian random vector, subvector.
1. Introduction
1.1. Subgaussianity
Recall that a real-valued random variable is called subgaussian if its tails are dominated by (that is, they decay at least as fast as) the tails of a gaussian. One of the several equivalent ways to quantify this property is using the Orlicz norm for the function . Specifically, the random variable is subgaussian if its Orlicz norm
[TABLE]
is finite.
Next, let be a positive integer, and let be a random vector in , that is, is a finite sequence of real-valued random variables defined on a common probability space. Also let and , and recall that the random vector is said to be -subgaussian at the direction provided that
[TABLE]
where
[TABLE]
is the inner product of and , and is the euclidean norm of the vector .
1.2. The problem
Let be a random vector with entries, and fix . For every subset of let denote the vector defined by
[TABLE]
In this paper we address the question whether the subgaussian behavior of the random vector at the direction is reflected to (and, conversely, whether it is characterized by) the typical subgaussian behavior of at the direction where is a random subset of distributed according to the uniform probability measure on or, more generally, according to the -biased measure111The -biased measure is defined by for every . (Here, and in the rest of this paper, we identify every with its indicator function .) ().
This question was motivated by a problem in density Ramsey theory; see Subsection 5.2 for more details. Related questions—though of a somewhat different nature—have been studied in high-dimensional probability and asymptotic convex geometry (see, e.g., [BN]), as well as in the study of thin sets in harmonic analysis (see [Pi]). It is important to note that the main point in our approach lies in the fact that, apart from the boundedness condition on , we make no further assumptions on the distributions of the random variables and on their correlation. (This level of generality is actually necessary for certain applications in combinatorics.)
1.3. Examples
At this point it is useful to give examples of bounded random vectors which are subgaussian at a given direction. For concreteness we will restrict our discussion to the direction , but corresponding examples can be given for any other direction.
Undoubtedly, the most important examples are random vectors with independent entries and, more generally, random vectors which are bounded martingale difference sequences. Another interesting class of examples consists of Sidon sets of characters in a compact abelian group . (Here, we view as a probability space equipped with the Haar probability measure, and we view every character as a complex-valued random variable on ; see [Pi] for details). Note, however, that all these examples are subgaussian at every direction.
A different—but quite relevant—example is a random vector whose entries exhibit high cancellation. More precisely, fix a -valued random variable . Assume for simplicity that is even, say , and fix a subset of with . We define by setting if , and if . Notice that , and so is -subgaussian at the direction for any . On the other hand, observe that ; consequently, if is -subgaussian at the direction , then . Nevertheless, it is easy to see that we may select, with positive probability, a subset of such that is -subgaussian at the direction .
All the above examples can be combined together by taking convex combinations. Precisely, let be a nonempty finite set, and for every let be a random vector in whose entries are either independent, or exhibit high cancellation in the sense we described above. If is any convex combination of , then clearly is at the direction , but it is already not quite straightforward to find a subset of with such that is -subgaussian at the direction .
1.4. The main result
Our main result shows that such a selection is possible in full generality. Specifically, we have the following theorem; more precise quantitative versions are given in Proposition 3.1 and Theorem 4.1 in the main text. (For our conventions for asymptotic notation see Subsection 2.2; recall that by we denote the -biased measure on .)
Theorem 1.1**.**
The following hold.
- (1)
Let , and let . Also let be a positive integer, let be a random vector in with entries, and let . If is at the direction , then for every
[TABLE]
Thus, the error term in (1.5) does not dependent on the dimension , the random vector , and the direction .** 2. (2)
Conversely, let , let , and let . Also let be a positive integer, let be a random vector in with entries, and let . If
[TABLE]
then is -subgaussian at the direction .
1.5. Sharpness of the probability
Although the lower bound in (1.5) is independent of the direction , we note that the probability appearing on the left-hand side of (1.5) does depend upon the choice of . Indeed, if , then this probability is . (See Corollary 4.11 in the main text.) At the other extreme, there exist random vectors and directions in for which the corresponding probability is at most for any fixed . (See Example 4.2.) In particular, the lower bound in (1.5) is optimal.
1.6. Related results/Outline of the argument
Beyond its probabilistic content, Theorem 1.1 can also be placed in the general context of property testing (see, e.g., [G]). Indeed, Theorem 1.1 essentially asserts that subgaussianity, at any given direction, is testable.
Theorem 1.1 can also be viewed as a partial unconditionality result, in the spirit of the work of Elton [E1, E2] and Pajor [Pa]. In fact, this is more than an analogy since part (1) of Theorem 1.1 for and the direction can be proved using the Sauer–Shelah lemma which is a main tool in the proof of the Elton–Pajor theorem.
That said, the proof of the general case of Theorem 1.1 is quite intrinsic and, apart from a couple of basic tools, it relies exclusively on properties of subgaussian random variables.
The first part is based on a large deviation inequality for the of the random variables which can be seen as a reverse triangle inequality; this is the content of Proposition 4.3 in the main text. With this inequality at our disposal, we detect the behavior of the probability in (1.5) using the -norm of the direction . Specifically, if and is sufficiently small, say , then we may select such that the corresponding probability is . On the other hand, if , then we fix a coordinate such that and we proceed by conditioning on the set of all such that .
Remark 1.2*.*
The argument is roughly analogous to the proof of Roth’s theorem [Ro]. Indeed, the case where the -norm is small corresponds to case of small Fourier bias and it implies pseudorandomness. On the other hand, the case where the -norm is non-negligible corresponds to the case of correlation with a character, and the proof takes advantage of this structural information.
The proof of the second part of Theorem 1.1 is quite simple, and it follows from a standard application of the bounded differences inequality.
1.7. Structure of the paper
We close this introduction by briefly discussing the contents of this paper. In Section 2, we fix our notation (which is mostly standard), and we recall some basic material which is needed for the proof of our main result. In Section 3 we give the proof of part (2) of Theorem 1.1, and in Section 4 we give the proof of part (1). Finally, in Section 5 we present and we comment on various extensions of Theorem 1.1.
Acknowledgments
We would like to thank the anonymous referee for carefully reading the paper and for several helpful suggestions.
2. Background material
2.1.
By we denote the set of all natural numbers. Recall that for every positive integer we set . Moreover, for every finite set by we denote its cardinality.
2.2.
We use the following and notation. If are parameters and is a positive real/integer, then we write to denote a quantity bounded in magnitude by where is a function which depends on and goes to zero as . Similarly, by we denote a quantity bounded in magnitude by where is a positive constant depending on the parameters .
2.3.
As we have mentioned, for every positive integer and every by we denote the -biased measure on , that is, the probability measure on which is defined by setting
[TABLE]
for every . In particular, is the uniform probability measure on .
2.4.
For every vector in and every by we shall denote the -norm of , that is, if , and .
2.5. Properties of subgaussian random variables
We will need the following properties of subgaussian random variables. For a proof, as well as for a detailed discussion of related material, see [V, Chapter 2].
Proposition 2.1**.**
Let be a real-valued random variable.
- (a)
If is subgaussian, then we have for every . 2. (b)
Conversely, let and assume that for every . Then, is subgaussian and, moreover, .
2.6. Hoeffding’s inequality and the bounded differences inequality
In various places in the paper, we will apply Hoeffding’s inequality and the bounded differences inequality. We will use these basic inequalities in a form which, although less general, is better suited to our needs. (The standard forms of these inequalities and their proofs can be found, e.g., in [BLM, Theorem 2.8] and [BLM, Theorem 6.2] respectively.)
Precisely, we will need the following consequence of Hoeffding’s inequality.
Proposition 2.2**.**
Let be a positive integer, and let . Also let . Then for any we have
[TABLE]
We will also need the following special case of the bounded differences inequality.
Proposition 2.3**.**
Let be a positive integer, let be a function, and let such that for every and every
[TABLE]
Also let . Then, setting , for any we have
[TABLE]
3. Proof of Theorem 1.1: part (2)
We have the following, more informative, version of part (2) of Theorem 1.1.
Proposition 3.1**.**
Let , let , let , and set
[TABLE]
Also let be a positive integer, let be a random vector in with for every , and let . If
[TABLE]
then is -subgaussian at the direction .
Remark 3.2*.*
We do not know which is the optimal dependence of the constant with respect to the parameters and . The referee noted that the dependence on could be improved; observe that the parameter is important in the sparse regime, that is, when .
Proposition 3.1 is based on two auxiliary results. The first one is an elementary identity which expresses the random variable as a linear combination of the random variables .
Fact 3.3**.**
Let be as in Proposition 3.1. Then we have
[TABLE]
In particular,
[TABLE]
Proof.
Observe that
[TABLE]
The estimate in (3.4) follows from this identity and the triangle inequality. ∎
The second auxiliary result is the following, fairly straightforward, consequence of the bounded differences inequality; we isolate this consequence for future use.
Lemma 3.4**.**
Let be as in Proposition 3.1. Then, setting
[TABLE]
for any we have
[TABLE]
Proof.
By the triangle inequality, for every and every we have
[TABLE]
Using this observation, the result follows from Proposition 2.3. ∎
We are now ready to proceed to the proof of Proposition 3.1.
Proof of Proposition 3.1.
Setting , by (3.6), we have
[TABLE]
Thus, by (3.2), we may select such that
, and 2.
.
Therefore, . By (3.4), (3.5) and the choice of in (3.1), we conclude that , as desired. ∎
4. Proof of Theorem 1.1: part (1)
4.1.
This section is devoted to the proof of the following theorem.
Theorem 4.1**.**
Let , let , let , and set
[TABLE]
Also let be a positive integer, let be a random vector in with entries, and let . If is at the direction , then
[TABLE]
It is clear that Theorem 4.1 yields part (1) of Theorem 1.1. As we have already pointed out in the introduction, the lower bound in (4.2) is optimal.
Example 4.2*.*
Let be an arbitrary positive integer, and set
[TABLE]
We fix a -valued random variable and, as in Subsection 1.3, we define the (high cancellation) random vector in by setting , and if . Since , the random vector is at the direction for any . Next, let be arbitrary, and set
[TABLE]
By Proposition 2.2, we see that . Moreover, if , then and, therefore, if is any positive real such that is at the direction , then . Thus, we conclude that for any ,
[TABLE]
4.2. A large deviation inequality for the -norm
The first step of the proof of Theorem 4.1 is the following large deviation inequality.
Proposition 4.3**.**
Let , and let . Also let be as in Theorem 4.1. If is at the direction , then for any we have
[TABLE]
In order to put Proposition 4.3 in a proper context recall that, by (3.3) and the triangle inequality, we have . The next corollary shows that this estimate can actually be reversed. Thus, we may view Proposition 4.3 as a reverse triangle inequality.
Corollary 4.4**.**
Let , and let . Also let be as in Theorem 4.1. If is at the direction , then
[TABLE]
In particular, if , then
[TABLE]
Proof.
It is a straightforward consequence of Proposition 4.3. Indeed,
[TABLE]
as desired. ∎
Corollary 4.4 can be used, in turn, to upgrade Proposition 4.3 and provide finer information for the distribution of the -norm of the random variables . Specifically, we have the following corollary; it follows immediately by Lemma 3.4, Corollary 4.4, and taking into account the fact that for every -valued random variable .
Corollary 4.5**.**
Let , and let . Also let be as in Theorem 4.1. If is at the direction , then for any
[TABLE]
4.3. Proof of Proposition 4.3
It is based on the following lemma.
Lemma 4.6**.**
Let , and let . Also let be as in Theorem 4.1. If is at the direction , then, setting , for every we have
[TABLE]
It is easy to see that Proposition 4.3 follows from Lemma 4.6. Indeed, let be arbitrary, and set . It is easy to see that with this choice we have that . Noticing that , by Lemma 4.6, we conclude that (4.5) is satisfied.
Thus, it is enough to prove Lemma 4.6. To this end, we need the following sublemma.
Sublemma 4.7**.**
Let be a real-valued random variable, let , and assume that for every . Then we have
[TABLE]
Proof.
Set . By Proposition 2.1, it suffices to show that for every we have .
Indeed, notice first that, since , we have . This, in turn, implies that if .
The remaining cases (that is, when ) follow from our hypothesis and a standard dyadic pigeonholing. Specifically, for every set and observe that
[TABLE]
Let be arbitrary and let be such that . Then we have
[TABLE]
and the proof is completed. ∎
We are ready to proceed to the proof of Lemma 4.6.
Proof of Lemma 4.6.
The left-hand side of (4.9) is scale-invariant; thus we may assume that , and it is enough to prove that
[TABLE]
for every .
Step 1. We will show that for every we have
[TABLE]
Fix . Let denote the underlying probability space. Let be arbitrary; since and , by Proposition 2.2, we have
[TABLE]
(We note that here is the only place in the argument where the boundedness of the random vector is used.) Next, observe that the event
[TABLE]
contains the event
[TABLE]
Finally, notice that since and is -subgaussian at the direction . Thus, by Proposition 2.1 applied to the fixed , we have
[TABLE]
Let denote the product probability measure of and . Then using: (i) the estimates in (4.13) and (4.16), (ii) the inclusion of the events in (4.14) and (4.15), (iii) the choice of the constant , and (iv) Fubini’s theorem, we obtain that
[TABLE]
or, equivalently,
[TABLE]
By (4.18) and Markov’s inequality, we conclude that
[TABLE]
which is clearly equivalent to (4.12).
Step 2. We will estimate the probability in (4.11) using a discretization argument, (4.12) and Sublemma 4.7. We proceed to the details.
Let be arbitrary. For every set
[TABLE]
and observe that, by (4.12), we have \mu_{p}(\mathcal{C}_{M}^{j})\geqslant 1-2\exp\Big{(}-\frac{2^{2j}M^{2}}{2Q^{2}}\Big{)}. Therefore, setting
[TABLE]
we have
[TABLE]
where the last inequality holds true since . Moreover, for every , by Sublemma 4.7 applied for “”, “” and “” and using again the fact that , we see that . This shows that (4.11) is satisfied, and the proof of Lemma 4.6 is completed. ∎
4.4. The main dichotomy
The next, and last, step of the proof of Theorem 4.1 is the following proposition which relates the probability on the left-hand side of (4.2) with the -norm of the direction . In particular, this probability gets bigger as gets smaller.
Proposition 4.8**.**
Let , and let . Also let be as in Theorem 4.1. Assume that and that is at the direction . Finally, let . Then, for every , the following hold.
- (i)
If , then
[TABLE] 2. (ii)
If , then
[TABLE]
Remark 4.9*.*
Note that the lower bound in (4.23) depends upon the choice of (thus, it is not uniform) but this is offset by making the subgaussianity constant of at the direction independent of . In (4.24), this phenomenon is reversed.
Remark 4.10*.*
The dependence on in (4.23) is tight up to a logarithmic factor. This can be seen by considering the diagonal direction of a random vector whose entries are truncated independent exponential random variables. We are grateful to the referee for pointing this out.
Proof of Proposition 4.8.
Fix , and set
[TABLE]
Since and is at the direction , by Corollary 4.5, we have
[TABLE]
Also write .
Part (i): Assume that , and set
[TABLE]
Notice that for every we have , that is, the random vector is \big{(}\sqrt{2/p}\,(12+\lambda)K\big{)}-subgaussian at the direction . Also observe that
[TABLE]
Thus, by Proposition 2.2 applied for the vector “” and “”, we obtain that
[TABLE]
Combining (4.26) and (4.29), we see that (4.23) is satisfied.
Part (ii): Now assume that . Fix such that , and set
[TABLE]
Observe that for every we have . Consequently, for every the random vector is \big{(}(12+\lambda)K\alpha^{-1}\big{)}-subgaussian at the direction . Since , the result follows. ∎
We close this subsection with the following consequence of Proposition 4.8 which complements Example 4.2 and concerns the behavior of the probability in (4.2) for the “flat” vector .
Corollary 4.11**.**
Let , and let . Also let be as in Theorem 4.1, and set . If is at the direction , then for every we have
[TABLE]
Proof.
It follows by part (i) of Proposition 4.8 applied to the vector “” (notice that ), the constant “” and “ ”. ∎
4.5. Proof of Theorem 4.1
The result follows by applying Proposition 4.8 for
[TABLE]
and observing that
[TABLE]
by (4.32) and the choice of in (4.1). Indeed, clearly we may assume that . Therefore, if , then, by (4.23) and the previous observation,
[TABLE]
while if , then, by (4.24),
[TABLE]
Remark 4.12*.*
Note that the lower bound in (4.2) can be proved without invoking Proposition 4.8. Indeed, one can proceed using Corollary 4.5, the elementary identity
[TABLE]
and Markov’s inequality. However, this approach yields a weaker estimate for the constant in (4.1) and, more importantly, it provides no information on the behavior of the probability appearing on the left-hand side of (4.2).
5. Comments
5.1. Extension to non-linear functions
Beyond the class of linear functions, Theorem 1.1 can be extended to certain chaoses which have a natural combinatorial interpretation: they are the homomorphism densities associated with weighted uniform hypergraphs (see, e.g., [L, Chapter 7]). Of course, in order to be meaningful such an extension, one has to select an appropriate normalization. We will adopt the scaling which appears in the bounded differences inequality222This choice is not optimal for certain classes of functions, but it appears to be the right choice at this level of generality..
5.1.1.
Specifically, let be a positive integer, and let be a bounded measurable function. For every set
[TABLE]
and define
[TABLE]
Notice that: (i) the quantity is a semi-norm, (ii) for every , (iii) if and only if the function is constant, and (iv) if is linear, that is, , then .
5.1.2.
Next, let be a random vector in with -valued entries. Given , we say that is -subgaussian with respect to if
[TABLE]
Observe that if is linear, then this is equivalent to saying that is -subgaussian at the direction . Also note that if the random vector has independent entries, then the bounded differences inequality yields that is with respect to .
5.1.3.
It is also straightforward to extend (1.4). Precisely, for every subset of let denote the function defined by
[TABLE]
where with if , and otherwise.
Thus, the non-linear version of the question discussed in the introduction is whether the subgaussian behavior of the random vector with respect to the function is reflected to/characterized by the typical subgaussian behavior of with respect to where is random subset of .
5.1.4.
It is likely that this problem is rather delicate. As we have mentioned, we will consider the case where the function is the homomorphism density associated with a weighted uniform hypergraph.
More precisely, let be a positive integer. For every integer and every by we denote the set of all subsets of of cardinality . Let be a weighted hypergraph, that is, is a map which assigns to every hyperedge a weight . The homomorphism density function associated with is the map defined by
[TABLE]
Note that if is a subset of , then the restriction of defined in (5.4) is naturally identified with the homomorphism density function associated with the induced on sub-hypergraph of .
5.1.5.
We have the following theorem.
Theorem 5.1**.**
The following hold.
- (1)
Let , let , and let be a positive integer. Also let be an integer, let be a random vector in with entries, and let be a weighted -uniform hypergraph on . If is with respect to , then for every
[TABLE] 2. (2)
Conversely, let , let , let , and let be a positive integer. Also let be an integer, let be a random vector in with entries, and let be a weighted hypergraph on . If
[TABLE]
then is -subgaussian with respect to .
The proof of Theorem 5.1 is similar to the proof of Theorem 1.1; for the convenience of the reader we present the details in the Appendix.
We also note that the lower bound in (5.6) is optimal. Specifically, we have the following analogue of Example 4.2.
Example 5.2*.*
Fix a positive integer , and let be an arbitrary integer. We define a weighted -uniform hypergraph on by the rule
[TABLE]
Also fix a -valued random variable , and let be the random vector in defined by setting for every . Observe that , and so is with respect to for any . Next, let be arbitrary, and set
[TABLE]
By Proposition 2.2, we see that . Fix and set . Since , we have
[TABLE]
On the other hand, note that
if (this is because ), 2.
if , and 3.
if
which implies that . Therefore, if is any positive real such that is with respect to , then
[TABLE]
Thus, for any we have
[TABLE]
5.2. Extension to partially subgaussian random vectors
Let be a positive integer, and let be a random vector in . Given and , we say333This terminology is not standard. that is -partially subgaussian at the direction provided that
[TABLE]
Notice that if , then this is equivalent to saying that the random vector is -subgaussian at the direction . Thus, this notion is of interest when is significantly larger than . Examples of random vectors which are partially subgaussian with parameters in this regime appear frequently in combinatorics, most notably in various density increment strategies. Specifically, one encounters random vectors in which are -partially subgaussian at the direction with and where is a very small constant; see [DK, Part 2]. The understanding of the statistical/concentration properties of these examples was the starting point of the present paper.
5.2.1.
It is not hard to see that Theorem 1.1 can be extended to -partially subgaussian random vectors, but of course one is also interested in determining the quantitative dependence on the parameter . In this direction we have the following analogue of Proposition 4.8.
Proposition 5.3**.**
Let , let , and let . Also let be a positive integer, let be a random vector in with entries, and let with . Assume that is subgaussian at the direction . Finally, let . Then the following hold.
- (i)
If , then
[TABLE] 2. (ii)
If , then
[TABLE]
In particular, Proposition 5.3 yields that if , for some , , and the random vector is subgaussian at the direction , then the probability on the left-hand side of (5.8) is at least
[TABLE]
that is, we have an exponential improvement upon (4.31).
5.2.2.
Not surprisingly, the proof of Proposition 5.3 follows the lines of the proof of Proposition 4.8. The only difference is that, instead of Corollary 4.5, it uses a straightforward variant of Lemma 4.6 for partially subgaussian random vectors. (In particular, the exponential gain in (5.10) comes from the fact that we need to control the tails up to .) We leave the details to the interested reader.
5.3. Extension to not necessarily bounded random vectors
It is open to us whether part (1) of Theorem 1.1 can be extended to random vectors with subgaussian, but not necessarily bounded, entries. Although the boundedness of is used only in (4.13), the strategy of our proof uses this property in an essential way and it cannot be dropped by merely optimizing the argument.
Appendix A Proof of Theorem 5.1
A.1. Preliminary tools
We begin by observing the following two simple facts; they will be used in the proofs of both parts of Theorem 5.1.
Fact A.1**.**
Let , let be positive integers with , and let be a weighted -uniform hypergraph on . Then, for any we have
[TABLE]
In particular, if is a random vector in with -valued entries, then
[TABLE]
Proof.
Write and notice that
[TABLE]
The estimate in (A.2) follows from (A.1) and the triangle inequality. ∎
Fact A.2**.**
Let be a positive integer, let be a random vector in with entries, and let be a bounded measurable function. Define by setting for every , where is as in (5.4). Then we have
[TABLE]
Proof.
The desired estimate is a consequence of the fact that for every bounded random variable we have
[TABLE]
Indeed, fix and , and observe that
[TABLE]
Thus, we have for every . This, in turn, implies inequality (A.3). ∎
A.2. Proof of part (2)
Let be as in part (2) of Theorem 5.1, and set
[TABLE]
We will show that if
[TABLE]
then is -subgaussian with respect to . To this end we need the following lemma.
Lemma A.3**.**
Let be as in part (2) of Theorem 5.1. Then, setting
[TABLE]
for any we have
[TABLE]
Proof.
Define by setting for every , and observe that . By Fact A.2 applied for the function “”, we see that . Hence, by Proposition 2.3, for any we have
[TABLE]
as desired. ∎
Now set , and let be as in (A.7). By (A.6) and Lemma A.3, there exists such that
, and 2.
.
(The last inequality follows from the definition of the semi-norm and (5.4).) Using these estimates, the result follows by (A.2) and the choice of in (A.5).
A.3. Proof of part (1)
The proof of this part is more involved. As we have already noted, the argument is similar to that of the proof of Theorem 4.1.
A.3.1. A large deviation inequality
The first step is the following analogue of Proposition 4.3.
Proposition A.4**.**
Let , and let . Also let be as in part (1) of Theorem 5.1. If is with respect to , then for any ,
[TABLE]
Proof.
Note that, arguing as in Subsection 4.3, it is enough to show the following.
*Let , and let . Also let be as in part (1) of Theorem 5.1. If is with respect to , then, setting , for every we have *
[TABLE]
The left-hand side of (A.10) is scale-invariant, and so we may assume that the weighted hypergraph satisfies . Thus, it is enough to prove that
[TABLE]
for every .
As in Lemma 4.6, we start by showing that for any we have
[TABLE]
Fix and let denote the underlying probability space. Let be arbitrary, and recall that . We define the map by setting \zeta(H)=\hom_{\mathcal{W}[H]}\big{(}\bm{X}(\omega)\big{)} for every ; observe that for every . Since , by Proposition 2.3 and identity (A.1),
[TABLE]
(Note that (A.13) is the analogue of (4.13). We point out that this is, essentially, the only step of the proof which differs from that of Proposition 4.3.) Also observe that the event
[TABLE]
contains the event
[TABLE]
On the other hand, we have since and the random vector is -subgaussian with respect to . Thus, by Proposition 2.1,
[TABLE]
Denoting by the product probability measure of and , the previous discussion yields that
[TABLE]
The estimate in (A.12) now follows from (A.17) and Markov’s inequality.
With inequality (A.12) at our disposal, we will estimate the probability in (A.11) using Sublemma 4.7. Precisely, fix , and for every set
[TABLE]
Also set
[TABLE]
By (A.12), we have \mu_{p}(\mathcal{C}_{M}^{j})\geqslant 1-2\exp\Big{(}-\frac{2^{2j}M^{2}}{2Q^{2}}\Big{)} for every . This estimate and the fact that are easily seen to imply that
[TABLE]
For every , by Sublemma 4.7 applied for the random variable “”, “” and “” and using again the fact that , we obtain that . That is, (A.11) is satisfied, as desired. ∎
A.3.2. Consequences
We will need two consequences of Proposition A.4. The first one is the analogue of Corollary 4.4; its proof is identical to that of Corollary 4.4.
Corollary A.5**.**
Let , and let . Also let be as in part (1) of Theorem 5.1. If is with respect to , then
[TABLE]
The second corollary is the analogue of Corollary 4.5.
Corollary A.6**.**
Let , and let . Also let be as in part (1) of Theorem 5.1. If is with respect to , then for any ,
[TABLE]
Proof.
As in the proof of Lemma A.3, define the function by setting for every . Recall that, by Fact A.2, we have
[TABLE]
Using Corollary A.5 and (A.23), the result follows by applying Proposition 2.3 to the function and the vector “\mathbf{c}=\big{(}\Delta_{1}(g),\dots,\Delta_{n}(g)\big{)}”. ∎
A.3.3. Completion of the proof
Notice that part (1) of Theorem 5.1 follows from the following, more informative, theorem.
Theorem A.7**.**
Let be as in part (1) of Theorem 5.1. Also let , and set
[TABLE]
If is -subgaussian with respect to , then
[TABLE]
Proof.
Set
[TABLE]
and observe that . Also set
[TABLE]
and
[TABLE]
By Corollary A.6, we have
[TABLE]
On the other hand, by identity (A.1), the fact that is a semi-norm, and the triangle inequality, we have . Moreover, notice that for every . Using these observations, we obtain that
[TABLE]
Therefore, by (A.29) and (A.30), we see that
[TABLE]
Finally observe that, by the choice of in (A.24), for every we have . The proof is completed. ∎
Remark A.8*.*
We note that it is also possible to obtain a partial extension of part (i) of Proposition 4.8. More precisely, if is the complete -uniform hypergraph on vertices—that is, if for every —or, more generally, if the weighted hypergraph is sufficiently pseudorandom444The notion of pseudorandomness which is needed in our setting is the following requirement: for every , if is sufficiently large (depending only on ), then for every and every with we have
, then the probability on the left-hand side of (5.6) is .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[BN] S. G. Bobkov, and F. L. Nazarov, Large deviations of typical linear functionals on a convex body with unconditional basis , in “Stochastic inequalities and applications”, Progress in Probability, Vol. 56, Birkhäuser, Basel, 2003, 3–13.
- 2[BLM] S. Boucheron, G. Lugosi and P. Massart, Concentration inequalities. A nonasymptotic theory of independence , Oxford University Press, 2013.
- 3[DK] P. Dodos and V. Kanellopoulos, Ramsey Theory for Product Spaces , Mathematical Surveys and Monographs, Vol. 212, American Mathematical Society, 2016.
- 4[E 1] J. Elton, Weakly null normalized sequences in Banach spaces , Ph.D. Thesis, Yale University, 1978.
- 5[E 2] J. Elton, Sign-embeddings of ℓ 1 n superscript subscript ℓ 1 𝑛 \ell_{1}^{n} , Trans. Amer. Math. Soc. 279 (1983), 113–124.
- 6[G] O. Goldreich (editor), Property Testing: Current Research and Surveys , Lecture Notes in Computer Science, Vol. 6390, Springer, 2010.
- 7[L] L. Lovász, Large Networks and Graph Limits , American Mathematical Society Colloquium Publications, Vol. 60, American Mathematical Society, 2012.
- 8[Pa] A. Pajor, Sous espaces ℓ 1 n subscript superscript ℓ 𝑛 1 \ell^{n}_{1} des espaces de Banach , Travaux en cours 16, Herman, Paris, 1985.
